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REACTION-TIME STUDIES: THE ANTICIPATION AND = E қ 
INTERACTION OF RESPONSES P A AAT) 


By E. A. C. Тномав 
St. John's College, Cambridge 


The present study is largely concerned with the relation between reaction 
time and the temporal uncertainty of the signal. It is suggested that the 
appropriate measure of signal uncertainty is the conditional probability that the 
signal would be presented given that it has not been presented before. ‘This 
conditional probability is called the expectancy function. It is assumed that the 
expectancy function influences the subject's state of readiness and this in turn 
influences reaction time; and ways of expressing these relations mathematically 
are discussed. It is shown that, with a suitable representation of subjective errors 
in time and probability estimation, a theory can be stated which accounts for the 
qualitative features of a variety of experimental data. 

The relevance of this theory to studies concerned with central intermittency 
is then discussed. It is argued that temporal uncertainty alone is insufficient to 
account for the increase in reaction time to the second of two closely spaced 


f signals, both of which require a response. On the other hand, a model for the 
1 interaction of two responses is stated in which refractoriness in the central 
" 


processes is not inevitable. 


1. INTRODUCTION E 

With the growing number of attempts by psychologists to construct a E M À 
otor performance has developed a revival of ГЕ 
interest іп reaction-time studies. In these experiments the subject is told to 
expect a signal (usually a flash of light or a pure tone well above threshold 
intensity), and to respond (usually by pressing a key) as quickly as possible after 
perceiving the signal. ‘The reaction time is the time between the presentation 
of the signal and the making of the response. At every trial the subject is given 
a warning signal, and after a variable foreperiod the signal is presented. 

There is a classification of reaction-time experiments based on the number 
of alternatives from which the signal may be chosen. In the simple reaction- 
time task the subject knows what the signal is going to be and his only uncertainty 
is as to when it will be presented. In the disjunctive situation the signal can be 
one of a set of alternatives, thus adding another source of uncertainty. Further, 
more than one signal of the set may correspond to the same response or more 
than one response to the same signal. In the serial reaction task two signals are 
presented in succession at each trial and the subject has to respond to both. 
Here both the foreperiod and the inter-signal interval are variable but are small 
compared with the inter-trial interval. Studies in this group have been con- 


cerned mainly with central intermittency, and their relationship to those on 
S.P. 
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simple and disjunctive reaction times seems far from clear. For a review of 
some of the issues involved the reader is referred to the article by Bertelson (1966). 

Тһе main aim of this paper is to discuss possible mathematical models of 
the relation between subjective readiness and simple reaction time. Studies on 
central intermittency will then be discussed in the light of these models. The 
next section is a review of some of the large body of experimental data which 
provided a starting point for the present study; and in later sections more data 
will be introduced to illustrate Specific points. The following abbreviations will 
be used: S refers to the subject, X to the signal, R to the response, RT to the 
average reaction time and SR to the subject's state of readiness. These abbrevi- 
ations will sometimes appear with suffixes, but the meanings should be clear. 


2. Some EXPERIMENTAL RESULTS 


In all the experiments reported below, the readings used were taken after 
Ss had had extensive Practice in the particular condition, Klemmer (1956) 
investigated the relation between foreperiod variability and RT by using two 
kinds of tests: Interval and Foreperiod tests. In the first kind of test a typical 
Series consisted of X being presented at regular intervals of length т, which differed 
among tests. These tests were later repeated by Karlin (1959) and, within the 
range 0-5 sec to 12 sec, it was found that RT increased with + and the slope of 
the curve decreased with т, i.e. the curve was concave, 

In a typical series of the second kind, the foreperiod at a trial was drawn 
from an eleven-point rectangular frequency distribution on (s,s+h). s, the 
minimum foreperiod, and h, the foreperiod range, were different for different 
series. "Тһе two statistics of interest were the RT to 
a particular interval, and the mean RT to all signals in a series. It was found 
al, but that the latter was an 
increasing function of h, though it showed little change when s was varied from 
1:25 to 7.25 sec, h being held constant Klemmer concluded that * S’s time 
uncertainty is the result of both his own imperfect time-keeping ability and the 
clock-time variability of the stimulus’. Тһе first factor is perhaps reflected by 


the RT-7 curves, and the second by the dependence on h of the mean RT for а 
Series, 


І 4 uncertainty two more kinds of 
experimental series were used (Klemmer, 1957). The first kind was the same 
‚Ше F i hat a normal frequency dis- 
tribution with mean ш and standard deviation (s.d.) о was used. To each 
series of this kind there was a corresponding series of the second kind, the 
Prediction tests, in which 5 was asked at each trial to estimate an interval of 
length и. S received immediate knowledge of the magnitude and direction of 
his error of prediction. The variance of the prediction times was taken as à 
measure of 5% uncertainty in estimating an interval of length и. The total 
time uncertainty, c2. for the Particular Foreperiod series (ш, о) was found by 
adding this Prediction variance to o. Klemmer found that RT was linearly 
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related to log ор. It should be noted that if there is no subjective time 
uncertainty, then 
log c, —log с 
— H — constant, 
where H is the Shannon information measure for the normal distribution. So, 
from the result that RT is linearly related to H (Hick, 1952), one might regard 
log c, as the ‘information’ to the subject whose time-keeping is imperfect. 

The four values of ш used in the Prediction tests were 2? sec, n=0, 1, 2 
and 3. Let the observed s.d. be denoted by о). Ап inspection of the results 
for the 5 Ss (Klemmer, 1957, Figure 1) shows that, at least approximately, 

log; o; —log, oy. ,—c, a constant. 
On=2¢ Oni 
= (20) oÍ = (20) ko 
= (25-1) kọ. 
Therefore, the s.d., о(т), of S’s estimation of ап interval of length т is approxi- 
mately given by 
о(т) = т, (2.1) 
where ky and c are constants. For опе S, с= 1-1, and for the other four 
Ss, c= 0:8. 

More recent work on Foreperiod tests has revealed a significant relation 
between RT and its immediate foreperiod (Davis, 1959; Karlin, 1959; Drazin, 
1961). Davis found that RT decreases linearly as immediate foreperiod 
increases from 0-05 sec to about 0:25 sec, after which RT' remains constant. 
Drazin used a continuous rectangular foreperiod distribution and has shown 
that, generally, the RT-foreperiod curve is decreasing with decreasing slope. 
It was also shown that the RT at the minimum foreperiod increases as s decreases 
from 2 sec to 0-125 вес. T'his suggests that there is a critical value of s below 
which S cannot make full use of the warning signal (see also Klemmer, 1956). 
Deviations from linearity at the smallest foreperiods are also evident in Davis 
(1959). 
In another experiment (Mowbray, 1960), which falls somewhere between 
the Foreperiod and Interval tests, S was required to respond only to a particular 
signal out of a set of n signals. At each trial each signal was equally likely to 
be presented, the presentation rate being fixed at one signal every 2:5 вес. It 
was found that RT increased linearly with z in the range 2&z «10. 

In some experimental series there was a fixed proportion of ‘ catch’ trials 
in which no signal appeared after the warning signal (Davis, 1956; Drazin, 1961). 
One reason for including them was to check whether S had a tendency to 
anticipate his reaction. Whenever, and only when, this was done, there was a 
tendency for the minimum AT to occur at an interval less than the maximum. 
The other comparable experiment in which this effect appeared was that of 
Elithorn and Lawrence (1955), and in this case there were trials on which only 
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опе signal was presented though S was warned to expect the omission. Mowrer 
(1940), using a different type of ‘ catch’ trial, observed an effect to which the 
above effect has been compared, somewhat misleadingly. Mowrer trained his 
Ss to expect a signal after a fixed interval and then presented them with catch 
trials at which the signal came at intervals different from the fixed interval. He 
found that the RT for the catch trials was greater than the RT for the fixed interval 
trials. On the basis of this finding, Davis (1956) has suggested that ‘ where 
several intervals are used in the training situation the subject reacts most quickly 

(a) at the most frequently presented interval, 

(b) at the interval in the middle of the range.’ : 

Still more recent work (Davis, 1965) has attempted to show that the relation 
between RT and foreperiod is insignificant. In this experiment the warning 
signal at each trial was a response made by S whenever he chose. Karlin (1965) 
has already pointed out that this experiment is inconclusive because, in subjective 
terms, the interval between an ad lib. response and a subsequent signal may not 
be equivalent to the interval between two signals. ‘This argument should 
become clearer later on. Other work (Bertelson and Boons, 1960; Kay and 
Weiss, 1961; Borger, 1963), though not concerned specifically with the RI- 
foreperiod relation, has tended to be consistent with the above-mentioned work 
and will be referred to later on in the paper. 


3. RT, SR AND THE ExPECTANCY FUNCTION 


The results quoted above suggest that when S knows what signal to expect 
but not when it will arrive, there are at least two relevant processes occurring 
іп time. The first describes the time-course of S's state of readiness (SR) UP 
to the time that X arrives, and is perhaps related to the objective stimulus 
conditions. The second process, dependent on the first, starts when > arrives 
and ends with the making of a response, and involves the recognition of У as 
the signal requiring a response and the organization of the response. For the 
present, no attempt will be made to specify the way in which this second process 
evolves, and attention will be confined to the possible relation of its length, 
ie. RT, to the endpoint of the first process. 

One of the first requirements of this approach is to specify the independent 
variable of which SR is a function. This variable would itself be a function of 
time (here assumed discrete) and the experimental conditions and, at every 
time 2, should measure the signal uncertainty at ¢ conditional on the process up 
to ¢ A natural measure of this sort is Pa the conditional probability that 
would arrive at £ given that it has not arrived before 2, and this is the measure 
which will be assumed to be relevant. It will be called the expectancy function. 
| The next step is to characterize SR and then to make assumptions about 
its dependence on р. Two ways of specifying SR might be by measuring levels 
of muscular tension or by measuring the electrical activity in the cortex. As far 
as the author is aware, more attention has been paid recently to the cortical 
correlates of stimulus uncertainty than to the muscular. For this reason an 
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because these two measurements are probably highly correlated, we shall 
consider the experimental findings concerning only cortical activity. In their 
study of waveforms evoked by auditory and visual stimuli, Walter et al. (1964) 
have shown that expectancy can correspond to the development of a large 
negative component in the waveform; this component is called the contingent 
negative variation (CNV). When S is told that a warning signal would be 
followed by another signal requiring a response, a large CNV appears rising to a 
peak just after the second signal is presented and then ending abruptly. They 
found that the introduction of catch trials tended to reduce the CNV (mixing 
27 catch trials randomly in a series of 48 reduced the CNV by about a half), 
though for most Ss the CNV showed signs of attenuation only when the 
proportion of catch trials exceeded a critical value (about 0:3). It was also 
found that in most Ss the development of a consistent CNV with an abrupt 
ending was associated with a marked reduction in RT to the signal. 

These results suggest, firstly, that it is possible to characterize SR such that 
it increases as py increases up to some value, фу, beyond which it is constant. 
The weak assumption will be made that SR is measured in units which make it 
directly proportional to p; for pr<px. То consider the effect on SR of time- 
keeping errors it will be assumed that S's error in estimating a time-point at 
which X may arrive is small, and that his counting off of successive time-points 
is perfect. The assumption of small time-keeping errors seems reasonable in 
view of the results of Woodrow (1930) showing that subjective estimates of time 
intervals in the range 0-2 sec to 1-5 sec had а standard deviation which was 
approximately 10 per cent of the standard interval. These assumptions enable 
us to proceed after specifying the SR only in a small, continuous neighbourhood 
of the time at which S expects X. Thus if T is S's estimate of t, then, in the 
absence of a signal, SR would rise to a local maximum proportional to р; at T 


Ж 
1 
1 
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Ficure 1. Time-course of subjective readiness (SR) when foreperiod distribution is 
discrete; e; is S’s error in estimating the ith time-point. 


and then decline. Let it be assumed that, for z in a continuous neighbourhood 
of T, SR has the expansion, 
SR(z) 2 lp т Т|-т|а-Тр-.... 
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If |z — T| is small, then, as an approximation, 
SR(z)-9 1р. m|z T], (3.1) 


where | and т are positive constants. his situation is shown in Figure 1. 
Secondly, we may suppose that RT decreases as SR increases. бо the RT 
to Z which comes at t is given by 


RT,-f[SRO) | (0<pe<pz) ; 
= РТ (b:2 2) H (3.2) 


where f(.) is a decreasing function of its argument and RT, denotes the minimum 
RT. Three simple forms of f(.) are the following: 


b 
Ла-а%-; (а) 
=a—bx; (b) (3.3) 
=a—b log x. (c) 


There is little evidence to help in the choice of f(.), but it will now be shown 
that the RT curves given by eqn. (3.2), where /(.) is given by (3.3 a), give a 
good fit to the data of Davis (1957, 1959) and Mowbray (1960). 

In the experiments of Davis the foreperiod distribution was uniform 00 
the integers 1 to n. Thus, by Bayes’ rule, 


= 1|п 
Руа 
-(а-і-І)і, 4-1,2,.... (3.4) 


If f(.) is given by eqn. (3.3 a), then for the ideal S whose timekeeping is perfect 
it follows from (3.1) and (3.2) that 
ЕТ,-а -51 l<t<j 
= КТ, j«t&n 


for some j. This linear relation is in accord with the experimental results. In 
the experiment of Mowbray there were » possible signals and the probability 
that a particular one would be presented at a given trial was, therefore, 1а. 
So that, for the ideal S, if (3.3 a) holds, the RT to the particular X when there 
аге п possible ones would be given by 


RT(n) =a" +b'n. 
This linear increase of RT(n) with z was in fact observed. 

Thus it is concluded that the expression (3.3 a) for f(.) is not unreasonable. 
Further tests of its adequacy can be carried out by comparing observed and 
expected RT curves using different foreperiod distributions. However, (3.3 а) 
seems simpler to handle mathematically and will be adopted here mainly (0 


_- m 
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provide a theoretical framework within which certain ideas on expectancy can 
be discussed. The discussion will be limited to two qualitative aspects of 
RT-foreperiod curves when the foreperiod distribution is uniform: the sign of 
the slope and that of the rate of change of the slope of these curves. Following 
a convention, we shall say that a curve is convex (concave) when this latter sign 
is positive (negative). 

First to be discussed are the Interval tests where there is no temporal 
uncertainty, so that S’s only error is in estimating the interval between signals. 
It is assumed that S's estimate of a time interval is equal to the true interval 
length plus an error term. This linearity assumption about subjective time 
estimates is retained in the discussion of the discrete Foreperiod tests. It is 
shown that this way of representing time-keeping errors provides a means of 
accounting for certain features of empirical RT curves. In the treatment of 
the continuous Foreperiod tests no explicit mention of time-keeping errors will 
be made and it is assumed only that 5 adjusts his SR intermittently. In Section 
5 an alternative mathematical treatment of the problem will be considered. 


4. MATHEMATICAL MODELS 


4.1 The Interval tests 
In these experiments X is presented after a fixed foreperiod, 7, and the 


effect on RT(r) of S’s time-keeping imperfection has to be considered. It is 
supposed, following eqn. (3.1), that SR rises linearly with gradient m so as to 
reach a maximum, /, at the point in time, Т, at which S expects X. Неге, 
however, it seems reasonable to suppose that if = does not arrive by T, this 
maximum is maintained until © does arrive (see Figure 2). If we write 
T=7+e, then e is the prediction error studied by Klemmer (1957). 


SR 


i 


- -4|----- 


3 
84 


Ficure 2. ‘Time-course of SR when foreperiod is a constant, т. 


S's RT would be lengthened whenever «>0, because in this case the 
not have been attained by the time т. Let E denote 


maximum SR would 
Then from eqns. (3.1), (3.2) and (3.3 a) it follows 


* mathematical expectation ’. 
that RT(z) is proportional to 


1 1 
+ (es) E| — leo | 4- constant, 
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If it is assumed that е is normally distributed with mean 0 and s.d. о, then 
ple<0)=1/2, 


and the probability that x< e « x-- dy given that e> 0 is 


at (—x?/207) dx, х>0. 


E(eļe> 0) = : 4 с ) Г x exp (202) dx 


So that, neglecting terms of order e? and higher, 


1 21 те 
Feo] 7 reli "F е>0| 


740] 


КТ(т) =a) +4, o(7), where ap а,> 0 and, from eqn. (2.1), о(т)- r^ 


d 
2 RT(7) = cakro » 0, 
and 
d? 
ja RT (zr) = —с(1—с)а,дуто—?. 


Hence if 0<c<1, an assumption suggested by the results of Klemmer (1957), 
the second derivative is negative. In other words, for small т, the curve RT() 
is a concave, increasing function of z. For larger values of 7 the term containing 
є needs to be considered. Then, since Е(е?|є > 0) = 0, 


2 
s ЕТ(т)= Hre В(1—с)], 


ИИ ПРЕ . ис, 
where A and В are positive constants, which is negative for 7< [B(1— c)] 


This result seems to be consistent with the curves obtained by Klemmer (1956) 
and Karlin (1959). 


4.2. The Foreperiod tests, discrete case è 
It is supposed that in these experiments X arrives at clock-time td, m 
d (secs) is a constant and t is chosen at random ч 


from the integers 1, 2,..9 
Then as pointed out in Section 3, the RT curve 


is the following: 
RT, =a'—b't Ixt«j 


= КТ, j«t«n 
for some j, 
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S's time-keeping imperfection can be accounted for in the following manner. 
Suppose that X arrives at time id; then S has had to ‘predict’ each of the time- 
points id, t=1,2,...,7 It is assumed that 5 predicts one time-point 
starting from the previous one, so that for each prediction S predicts an interval 
of length d and does so with an error e. It is also assumed that е is N(0, o°), 
where o<d. Then the error, e; in predicting the interval of length id is 
N(0, 202). The SR curve for this situation is shown in Figure 1. 


1 | b 
ШЫП ТЕПТІ ЕГЕСІ 


2 
-acb[(n—ic-1)3—g4/1]3, where ТЕЕ )] «1, 
T 


=a+b(n—it+1)+bg(n—i+1)*/2, (4.2.1) 


neglecting terms of order 2? and higher. 
The following restriction on g is imposed: 


(n—i+1)-*-gy/i>0 for 7=1,2,..., п. 
This is satisfied if 
/3 
g< ans (n+ 1)792. (4.2.2) 


Plot of [(m—i--1)? —g4/i]^ against i, for n—10, 


FIGURE 3.  RT-foreperiod curves. 
Ficure 3 orep POR, Gi 
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"Treating i as continuous, it follows from eqn. (4.2.1) that 


CERT Ogi ass а LASS 
3: x [152 — 6(n + 1) — (n + 1)2], 
which is negative for ¿< 7-9(a 4-1)/15 approximately, and positive for the rest 
of the range. In other words, the curve is concave for about the first half of 
the range and convex for the other. 
It is easy to show that (4.2.2) implies that ЕТ,<ЕТ, , However, 
RT,» RT, if 


(n— 1)? -g4/2 « i1 — g, 
ie. if £»[n(a - 1)(/(2) 21]. (4.2.3) 
It can be seen that (4.2.2) and (4.2.3) are compatible only if 24. So that if 
n> 4 itis possible that the maximum RT occurs not at the shortest foreperiod but 
at a longer one (see Figure 3). An effect of this kind has been observed by 
Elithorn and Lawrence (1955) and by Klemmer (1956). 


4.3. The inclusion of ° catch’ trials 


Consideration will now be given to a foreperiod series with a uniform 
frequency distribution of foreperiod in which a proportion 1 — т of catch trials 
is included. Then 


prob. (X arrives at 2) 


m prob. (X arrives at ¢ or after) 
m [n 
= Tato tO. (4.3.1) 


So that for the ideal 5, for some + 
ВТ,-а”-ы Ixt«j 
RIS j<t<n. 
This linear decrease is contrary to the experimental results of Davis (1956) and 
Drazin (1961), who found that RT; reached a minimum for some t <n and then 
increased for larger f, The introduction of time-keeping errors would not 
account for this feature since, as was shown in the previous section, RT is less 
than RT, ay provided the errors are small and л> 4. ia] 
In deriving eqn. (4.3.1) it was assumed that the subjective belief that a s 
is not a catch trial is unchanged while S waits forX. However, it is likely e" 
this subjective belief decreases while S waits for X and this idea is now developec. 
The 7 in (4.3.1) is replaced by z, the subjective probability at time 1 that 
the trial is not a catch trial; and the substitution оң = zr lis made. "The turning 
point (minimum) of pr is required. From (4.3.1); 
Pit pis “о-ө, )-1 
=0 when 
9t— 0t 1—]1[n. (4.3.2) 
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It remains to make assumptions about the form of w. The following three 
assumptions seem acceptable on intuitive grounds: 
(i) o= 771, e — ot 42 0, i.e. S’s belief at the start of a trial that > would 
be presented matches the objective probability, and then declines. 
(ii) e; —1 for all ? when т= 1, i.e. if no catch trials are included, S does 
not waver in his belief that X would be presented while waiting for it. 
(iii) The rate at which S loses his belief is small initially and then increases, 
ie. тү is concave. This implies that оу is convex. 
A fourth assumption is suggested by the empirical finding that RT,» КЕТ, 
which implies that 5,7! > рат, and hence 
(iv) wn< «7! - (1— 1/n). 
That these four assumptions do not guarantee that (4.3.2) will always have 
a solution 0( < л) can be seen by putting 
cog — 77 4 (1—1/n) — 8, 0« 8«(1—1/n) 
tt — 9t 37213 — 8[(n — 1). 
For then all four conditions are satisfied, but 
wi— ot 4«l[n for 1<t<n. 
On the other hand, if (4.3.2) has a solution < n, then 
cot — ox, 47 1 [n for all t > 0. 
Or, to put it crudely, for (4.3.2) to have a solution the convexity of ш, must not 
be too small. The two types of expressions for «x that will be considered аге: 
(a) a polynomial of the form 


ups DE жі, (4.3.3) 
j=0 
where the c’s are positive; and 
(b) the exponential expression 
а= d + dimt, d? 0. (4.3.4) 
Eqn. (4.3.3) satisfies conditions (i) to (iii) if the c's are given by 
соз Tk, 
сұт т" -1- п“, ДЕЙ evt 


where —1=a)<0,<-..<ay. For simplicity it will be assumed that —1= а 
=ap <ae=a. Eqn. (43.4) also satisfies these three conditions 


=a =... 


provided 
тф-1-а,. 


It will now be shown that solutions, 0, of eqn. (4.3.2) such that 0< аге 
possible, but that all these solutions exceed (7+1)/2. The following results 


will be used: 
пк (п = pen > 2nk—1; (4.3.5) 


(n+ 1)Ë—(n—1)k < 2&(nF — 1) (n — 1); (4.3.6) 


for n>3, kz2. 
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The sequences {ил} and {vy} are decreasing, where 


иа-тт!-1-(п-1)т", (4.3.7) 
апа 


v4 —(1— 77)/n; (4.3.8) 


for т<1. | 
The inequalities (4.3.5) and (4.3.6) can be established by induction on Ё. 
For, putting k=2 in (4.3.5), it follows that 
n? —(n—1)8=2n?-—1+(n—1)(n—2) > 2n?-1. 
If (4.3.5) holds for k, then 
пЁ+® — (n — 1) = nnk H — (n — 1592 
»n[2n* + (n — 1) —1] 2 (n — 1)**? 
=2nk+1—1+(n—1)[(n—1)F-1] 
>2nk+1— 1, 
so that it holds for k+1 and, therefore, for all k. The proofs of (4.3.6) to 
(4.3.8) are omitted. 
A theorem about the o, will now be proved. 
THEOREM 4.3 


If w, given by eqn. (4.3.3) or (4.3.4), satisfies conditions (i) to (iv), then 
(а) solutions, 0, of (4.3.2) such that 1<@<n are possible, and 
(b) if a solution exists, then 0 > (n + 1)/2. 


PROOF 
Case (1). w= та Tr —mq) kz2. 
Condition (iv) implies that 
[n( 771 — z°)] > (nk — 1)( — 1)2. 
From (4.3.2), 
[n(m — 23)]-1 = 0 — (8 — 1)к, 

from which it can be seen that 0 cannot be less than n unless 

nk — (n — 1) > (nk — 1)(n— 1) or 

nk+1— (п — 1)+1 > 25k — 1, 
which from (4.3.5) is true. So that a solution 0<n is possible. 

То show that, if a solution exists, 0 > (n + 1)/2, it is necessary only to show 


n+1\F /п—1\® 
22 - (>) < (п# — 1)(п—1)-1, 
which is the inequality (4.3.6). 


Case (2). we=(1—d)n 4dr, 
Condition (iv) implies that 


that 


d< s? (n — l)[n(1 — 5212, 
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From (4.3.2), 
а= z=°[n(1 — z>)] *, 
so that 1 < 0 «2 only if 
e? [n(1 — r) < (un — 1)[n(1 — gy < z[n(1 — z)] - 

The left-hand inequality reduces to 

1—т>(1—т"—!)(п— 1), or 

9, 7 0, ,, Which follows from (4.3.8). 
The right-hand inequality reduces to 

(n-1)0—7)77? «1— тй-1) or 

ng-!—(n—1)z" <1, or 

иң <14, which follows from (4.3.7). 


Therefore a solution 0 < » is possible. 
If a solution exists then 


a! [n(1 — z)]| « z"(n— lpa1-5"7)] or 
79> (1 т") (п – 1) түрі. 
"Го show that 0> (n + 1)/2 it has to be shown that 
т(а-1)/® <(1 — т"-ї)[п—1)(1—т)] 1, or 
жа)-і-т<(т-"- zm) /2m=2(7); 


where m=(n—1)/2>1. 


But 
dz КЕ ау 
їт|т=1 | 7 da|a=1' 
and 
Фе " š Фу 
i387 тез [m(1 т") + 14 т?т] > 0 = PE 


:.а(т)>у(тл) for 0< z «1l, 
which completes the proof of the theorem. 

'l'he above theorem gains some generality if one accepts that any convex, 
increasing function, c, can be fairly well approximated in the range 1<i<n 
by a curve cither of the form (4.3.3) or of the form (4.3.4). The theorem then 
leads us to expect that when catch trials are included, the minimum RT would 
occur at a point greater than the midpoint of the range. This result seems to 
be at variance with the suggestion (Davis, 1956) that maximum preparedness, 
and hence minimum R7, occurs at about the midpoint of the range. 

0 can be obtained from the experimental data, and (4.3.2) can then be 
solved for а or d. It would then be possible, using this estimate of « or d, to 
estimate the value of t( < 0) for which RT, is approximately equal to RT». This 
value of t can then be compared with the observed value. The parameters о 
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and d are thought of as being possibly dependent on 5 but independent of z. 
These predictions, together with that of Theorem 3.4, can be regarded as tests 
of the adequacy of the above approach. 


4.4 The Foreperiod tests, continuous case 


In Section 4.2 it was implicit that S was aware of the discrete nature of the 
foreperiod distribution so that he was able to build up his SR to reach local 
maxima at the distribution points. When the foreperiod distribution is sub- 
jectively continuous (assumed uniform on the interval (s,s+h)), there are no 
such preferred time-points at which SR should be maximized. Похусхег, one 
can assume that, as in the discrete case, SR is adjusted at discrete points, but, 
since the distribution is now continuous, it is necessary to specify SR for all 
t,s<t<s+h. Let it be assumed that S adjusts his SR at instants s, $-- p 
еке 50)... SEU HL. »+wy=s+h, апа that in the wth interval of length 
Wn his SR is proportional to the conditional probability, p, that X would arrive 
in that interval. Then 

Pn=Wn|(Wnt+ Wn, + we +оо). 
Let tp be the midpoint of the nth interval, i.e. 

th=S+W,+... +Wn_1+ (0/2), 
and let RT, denote the RT when X arrives in the nth interval. By the RT- 
foreperiod curve will be meant the plot of RT, against ty. 

Almost any shape of this curve can be explained by making appropriate 
assumptions about the finite sequence {wn}. However, consideration will be 


restricted to sequences of the type “а-а, ,. Then the following theorem 
can be proved. 


THEOREM 44 


If v, = o0, уу then the RT-foreperiod curve is decreasing and is convex 
if and only if x > 1. 


PROOF 


(i) If а=1, it is easily seen that this case is similar to the discrete case 
considered at the beginning of Section 4.2 where it was shown that 
RT,—a,—b. 


(П) «<1; = а. 


П-еә(1-а5(1- gt, 
and 


Vn... тш, -а(ал-1-- anyi =}, 


із-(- ®)(1—хУ—п+ Der 
and 


RT, —a44 by(1— a^- т41), 
Let Anf denote fnji— fn. Then, 


An at = (wn +0) [2, = ал-?% (1 + a)/2, 
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and 
An_,RT 
NP 
AnS = —2byo*-22+1(1 — a)?[eo, < 0. 
That is the RT-foreperiod curve is concave, decreasing. 
(iii) «»1; Әп-(а-1)аУ-а1-1)-і; 


n= — 2b, 9 3(1 — a) [ves (1 + a)] « 0, 


and 
An_t=a"—*w,(1 + a)/2. 
Here it can be seen that 
8, = —2b,aN-21*9( — T) [es (1 + a)] < 0, 
and 
An = 2b, «N21 (a — 1)2/z0, > 0, 

ie. the RT-foreperiod curve is convex, decreasing. This proves the theorem. 

The convex shapes obtained by Drazin (1961) suggest that a value of 
«z1 would be appropriate in this type of model. This would mean that if 
S adjusts his SR intermittently as he waits for X, then his rate of adjustments 


decreases as he waits. 


5. AN ALTERNATIVE TREATMENT USING THE ExPECTANCY FUNCTION 


In this section it will be assumed that the foreperiod distribution is con- 
tinuous; it will be seen that analogous results hold for the discrete case. Let 
f*(t)dt be the probability that У arrives in the infinitesimal interval (t, t+ dt). 
Then, as in Section 3, the expectancy function is defined by the relation 


f*(0) қ І бош 
#(t)= ———— , where К%()= u)du. 
P= arg Q-[ rw 
This mathematical expression is known as the hazard function in reliability 
theory, as the intensity function in extreme value theory and as the force of 
mortality in actuarial work. It has already been introduced in reaction-time 
studies by Audley (1963) as a measure of the theoretical expectancy of an event 


at any point in the distribution. 
In Section 4 we assumed that S’s error was in time-keeping, but Audley 


(personal communication) has pointed out that subjective probability distribu- 
tions can be significantly different from the corresponding objective ones and 
some consideration should be given to this fact. Let f(t) be the subjective 
estimate of f*(t), and let p(t) be the corresponding subjective expectancy function. 
In accordance with the arguments outlined in Section 3 it will be assumed that 
the RT to a signal presented in (1, 1+) is an increasing function of 1/p(é). 
Then the shapes of the empirical RT curves suggest that, in general, p(¢) is an 
increasing function of t. Two exceptions are the case when p'(t) s dp(t) [dt is 
negative for small 7 and then becomes positive (Elithorn and Lawrence, 1955; 
Klemmer, 1956), and that when р'(ї) is positive up to some point and is negative 
towards the end of the (finite) foreperiod range (Drazin, 1961). "Тһе question 
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that will concern us here is under what mathematical conditions on f(t) can 
these two special cases arise. 

Properties of monotone p(t) have been studied by Barlow et al. (1963), who 
showed that if the distribution range is finite, (0, 2) say, then 

lim p(t) = co, 

th 
which implies that p(t) cannot be decreasing near л. In other words, a necessary 
condition for (t) to be decreasing at the end of the foreperiod range is that the 
total subjective probability up to the endpoint is less than unity. This is very 
likely to be the case when a proportion 1—z of catch trials is included, since then 
the total objective probability up to the endpoint is z «1. It follows that to 
outline a general approach to the question for all z it is desirable to assume that 
the subjective foreperiod range is larger than the actual range for all я, including 
7—1. This assumption will be made, although the main interest is in the 
behaviour of (2) in (0, h), the interval in which X can be presented. It is implied 
that S always underestimates unit probability. 

If it is assumed that RT(ż) is a linear function of 1/[p(t), then the RT curves 
for an ideal S and a uniform foreperiod distribution are straight lines. However, 
linear RT curves do not necessarily reflect that S is ideal. For, from the 
definition of p(t), 


K=- Sin 1-0) 


whence 


1-F()-exp[- | pds]. (5.1) 


Putting 


р(х) = = 6>0, 0<х<т, 


іп eqn. (5.1), it follows that 
1—F(t)=exp [£ In (y—2) + constant]. 
Using the condition Е(0)-0, 


£ 
1- F()- (1 " 3 „ Ukish: 62 
So that if 5% estimate of a uniform distribution belongs to the family (5.2), Ts 
RT curves would be indistinguishable from those of the ideal S. 

Тһе more familiar distributions such as the family (5.2), the normal, the 
gamma and the Weibull distributions, all yield monotone expectancy functions. 
It is possible, however, to obtain non-monotone expectancy functions by 
‘ mixing’ these distributions. Two such examples which exhibit one or both 0 
the features, i.e., p(t) decreasing for small t, and for t close to h, the endpoint 0 
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the range. ‘ Mixtures’ of the exponential distribution with (5.2) will be 
considered for £—1 and £ =2. 
Suppose that 


т 


J(= +r eT 0<і<?, 
5.3 
= ge M. 9 «t. pu 
ту + 75 = 1, and it is presumed that |y — | is small. 
'Then 
: тә — At 

рр ЕРНАР. 

т1(7—1) + 7e ^ 
-А 7<і. 


0<1<ч 


It can be seen that р'(0) > 0 if and only if (7А — 1) < mat, whilst p’(7) is always 
positive, so that this expectancy function can exhibit the first feature (compare 
Figure 3). 

Suppose now that 

2 -і 
f(t) = OD aterm «ren 
(5.4) 
= mÀ e n<t. 

Then 

nip’ (t)[1 — FO) 922(-1? miran? {A — 2) — 2\2—2]; so that 
р'(0)> 0 if and only if (71-2) « 22,71, whilst 5'(5) is always negative. 

To show that p’(t) is sometimes positive in (0, 1) if (qA—2)? 2 2747, it is 
sufficient to show that [A(zj —£) - 2]? —2 is sometimes negative in this range. 

[Mn —t) -2] - 2 = (t? - 2(34 2)t- 399—492 42. This expression is 
negative only between the values of ¢ given by 


22/2 
7-73 


and both values lie in (0, 7) since 
2 
didi YS eum 
25% 


This expectancy function can therefore exhibit both features if (Q) — 1? » 20,1 
and | = is sufficiently small. 

То study the effect of introducing catch trials we can examine the effect on 
p(t) of reducing (Л), which can be done by changing any one of the parameters 
тың or A. In Figure 4 the effect is shown of reducing 7, from 3/4 to 1/4 when 
А-12, 524, 1-33; F(h) is reduced from 0-93 to 0-85. It is seen that the 
effect is to produce a minimum RT at an interval less than the endpoint of the 
Similar results would hold for the discrete case if one were 


foreperiod range. i 
the geometric and regard ¢ in eqn. 


to replace the exponential distribution by 
(5.2) as a discrete variable. 
B 


S.P. 
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One remaining question concerns the interpretation of a mixed distribution 
such as (5.3) or (5.4) when the experimenter uses a uniform foreperiod distri- 
bution. One can suppose that the continuous treatment given here is an 
approximation to the following discrete one. Suppose that the time axis is 
quantized into small intervals of length А. Іп the ith interval S estimates the 
unconditional probability, f(éA)A, that X is presented in that interval, and the 


(zl 
cumulative sum of his estimates up to the ith interval, i.c. F(iA) = X f(rA)A. 


к=! 
His SR is based on the ratio Af(iA)/[1 —F(i^)]. One can suppose further that 
the estimates f(/^) are independent of each other, and result from S’s being in 
one of two states. With probability ху, 5 is in the first or ‘ attentive ’ state and 
he behaves like the ideal S, i.e. his subjective distribution is one of (5.2). With 
probability т he is in the other ‘ inattentive ’ state, and in this state his estimate 
comes from an exponential distribution. With this interpretation the introduc- 
tion of catch trials should increase the probability of his being in the second state, 
Le. reduce ті, while leaving unchanged the parameters of the component 
distributions. This is the state of affairs illustrated in Figure 4. On the other 


hand, one can regard a mixed distribution merely as an approximation to the 


20 


T5 


1.0 


0:7 


0 1 2 3 


FIGURE 4. Effect of * catch’ trials. P 
- Plot of 1/p(t) for 
К) - 2a (a—2) fa? + ту e- M, ee i 
= TA e, axt. i 


71—3/4, 1/2, 1/4; а=4; \=1/2. 
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actual subjective distribution. In this case the search should continue for better 
approximations to the subjective distribution which also give non-monotonic 
p(t). It may be as well to remark that a * mixture ' of the two gamma distri- 
butions, \,2¢ exp ( — А?) and А, exp ( — Ast), can exhibit either of the two features 
depending on the value of А,/А but because of the absence of data on subjective 
distributions this question will not be pursued further here. 


6. CENTRAL INTERMITTENCY 
6.1. The psychological refractory period 

In the serial reaction task 5 is told to respond to two signals at each trial. 
The first, Жу, is presented at time ¢ after a warning signal, and the second, >, 
after a time t-+J; I is the inter-signal interval and ¢ and J are variable. The 
statistic of primary interest here is RT), the RT to, It has been found that 
RT) is minimal when J is large, that this minimum is maintained as J is reduced 
down to some value, Zz, and that for T < Is, RT) is greater than the minimum 
(Craik, 1948; Hick, 1948; Vince, 1948). The period of length І, is called the 
Psychological Refractory Period. It is the period following the presentation of 
Y, during which a second signal would give rise to a second RT longer than the 
minimum. ‘The lengthening of RT) above the minimum when І<І is 
called the delay and is generally interpreted by workers in this field as the 
© storage ? time, or the interval between the presentation of >, and the initiation 
of the second response. 

Evidence has been adduced to show that the delay occurs in the central 
nervous system and the first theory about its magnitude was put forward by 
Hick (1948) and Welford (1952, 1959). According to Welford, delays are due 
" to the central processes concerned with two separate stimuli not being able 
to coexist, so that the data from a stimulus which arrives while the central 
mechanisms are dealing with data from a previous stimulus have to be ' held 
in store’ until the mechanisms have been cleared " (Welford, 1952, p. 3). This 
statement is also referred to as the intermittency hypothesis. So that if RT) 
is the actual first reaction time and 87), the minimum second reaction time, 


then, by Welford, 
РТ) = ВТ + RT) — I. 

This formula cannot explain delays which occur when J> ТӘ), Welford 
has explained such delays by supposing that before S attends to E, he has to 
wait for a feedback signal indicating that R, is being made. Davis (1957) has 
explained similar delays by supposing that there is a ‘ Central Refractory Period ° 
after the central processes have dealt with X, and during which they remain 
unoccupied. 

However, in other experiments (Elithorn and Lawrence, 1955; Halliday 
et al., 1960; Borger, 1963) the delays obtained were much smaller than either 
the Welford theory or the Davis refinement would predict. Elithorn and 
Lawrence attempted to account for the delays by proposing an ‘ expectancy ' 
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theory in which RT was a function of SR. Their approach, though ууа 
shortcomings (see Davis, Hick and Welford, 1956), will be found to be simile 
to the one to be adopted here. | | ҚИ "T 

Тһе present contention is that temporal uncertainty of signal influences 4. 
for R, the uncertainty is a function of t, and for R, it is a function of ТА Tit ы 
X, provides the warning signal for Х,. Іп most of the work on intermittency 
the minimum AT is referred to as the normal RT, and any increase over this 
minimum is called the delay. In the present study the normal RT will be taken 
to be the RT to a signal when no overt response is required to the warning signal. 
Thus, in the present context, the normal second reaction time, RT (ог 
simply RT), is the RT to X, when S is not required to respond to Y. a 
avoid confusion, the term /ag will be introduced to refer to the increase m R 
over RT, when S has to respond to Х,. The lag in the second RT is (RT?) — 
RT.) and measures the response interaction when both signals require a PEDES 
It is zero, by definition, either when no response to Y, is required or when 
the inter-signal interval, is greater than RT). Positive lags have been obtained 
by Davis (1959) and Kay and Weiss (1961), who performed experiments 1n 
which no response to 2, was required, and in which 5 responded to both signals. 

In sum, it is supposed that S attends to a response as soon as the appropriate 
signal is presented and that his RT depends on 

(a) his SR at the time of presentation of E, and 

(b) whether or not he is makin 

Some of the theory developed 
the lags from the data of Davis 


g another response at the same time. " 
in Section 4 will now be applied to calculat 
(1956) and Halliday et al. (1960). 


6.2. Calculations using simple and disjunctive RTs 


Before lags can be estimated from observed RT) in the serial reaction task, 
it is necessary to obtain estimates of the normal RT я 
conditions. Let us suppose that X, сап be presented at one of the time-points 
t—1,2,...,n (the clock-time between the time of presentation of X, and one 
of these time-points is simply the inter-stimulus interval, Г), and let RT; denote 


t 
the normal RT to У, when it is presented at the ith point. We assume tha 
RT; is given by 


4 f menta 
under the given experiment 


RT,—a4 bp 0c pi bs 


= КТ, Dt pa, 
where p; is the expectancy function and can b 
the experimental procedure. 


(6.2.1) 


of 

e calculated from a knowledge к: 
: The constants a, b and ЕТ» сап be determin ей 

by fitting eqn. (6.2.1) for values of ¢ corresponding to values of J> RT, sin 


for these values of I the lag is, by definition, zero and the observed ВТ) is зи 
normal RT to Z}. The fitted curve can then be extrapolated to give estimat 
of RT; for all 1. 


B Р . . " 
T'wo remarks will be made concerning the extrapolation of АТ curve 
First, it 


was stated in Section 3 that there may be an upper limit, pz, beyon 
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which greater conditional probability of signal occurrence does not affect RT. 
So that fitted RT curves should be used as prediction formulae for large p: only 
when RT, is known. Since RT, is probably in the region 0-15 —0-18 sec, it 
will be assumed that if a 4-5» 0-18 sec then extrapolation of fitted RT curves 
is valid for all р. Second, in the experiments of Halliday eż al., X was equally 
likely to be one of two signals. It seems probable that signal detection takes 
longer in a disjunctive situation than in a simple one, so it will be supposed that 
the effect of having two or more equally likely signals is to increase a and RT; 
in eqn. (6.2.1), while leaving 6 unchanged. It would follow that if, for a given 5, 
a and b are estimated from disjunctive RTs and then used to estimate normal 
RT in a simple situation, the values of normal RT so obtained would probably 
be overestimates. 

In the experiments of Davis (1956) a proportion 12/64 of catch trials was 
included, so that f; can be calculated by using the theory of Section 4.3 together 
with knowledge of the experimental procedure. I (secs) was chosen from a 
16-point uniform frequency distribution, the points lying between 0-05 and 
lsec. For both Ss examined, RT; had a minimum at about t=12, so using 
eqn, (4.3.4) and putting 7=52/64, n —16 and t= 12 in eqn. (4.3.2), the estimate 
of d is 0-028. Also, the estimated value of ż (< 12) for which RT, should be 
approximately equal to ЁТ is between 5 and 6. This last ‘ prediction’ is 
borne out by the curves for both Ss. "Тһе constants in (6.2.1) were estimated 
pn the method of least squares and the lags, (RT€) — АТ), are shown in 

"able 1. 


Taste 1. Lacs mw RT) (ім Миллѕесѕ) ғов DIFFERENT VALUES OF 7. RT ABOUT 
0-23 sec. Dara OnramED FROM Davis (1956) 


1 (secs) 0-05 0:10 0:15 0:20 
Subjects 
S, 117 69 37 23 
5% 110 25 15 7 


T In per experiments of Halliday ег al., E, could have appeared on either the 
eft- or right-hand side of a screen and X, appeared on the side opposite to Хі. 
At each trial signals could have been presented at times 0-00, 0-05 or 0-10 sec 
(t= 1, 2, 3) after the end of a warning signal. Both were presented with proba- 
bility 3/4, otherwise only one was presented with the restriction that signals 
were not presented simultaneously. The experimental probabilities of signal 
occurrence at the three time-points are shown in Table 2 (а). Though catch 
trials were included, the possible change in subjective belief was likely to be 
negligible since there were only three possible time-points. Using the proba- 
bilities in Table 2 (а) the eqn. (6.2.1) was fitted by plotting АТ against 
probability for the disjunctive single situation only. 

One of the conclusions of Halliday et al. was that for subject II, RT) 
was ‘ speeded up’ if X, was a left-hand signal. Values of b and a +b are shown 
in Table 2 (b); also reproduced there for this subject is АТО), the observed RT 


Ressarch 


Bureau of Egel h Fsyl. 
(5 СЕК! ) 


Date .,.....”е“”“ 
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TABLE 2. DATA OBTAINED FROM HALLIDAY ET AL. (1960) 


i i i i al is the c itional 
(a) For any ' time’ column the element in the leading diagonal is the conditio 


probability for Xj, and those off the diagonal are the probabilities for Xa 
‘Times “0” 202 “100” 
7/24 3/7 3/4 
2/5 3/4 
1/2 
(b) Estimating a and b by fitting the curve RT —a--b р. 
Right hand Left hand ыры 
Period b a+b b a+b RT 
1 1:6 239 23-9 227 
S, 2 192 229 17-3 228 
3 134 225 13-1 230 
Меап 231 Mean 228 
1 (0)* 217 26:3 192 202 
2 0 233 29-3 214 205 
3 (0)* 219 214 212 212 
Sy 4 2:9 230 15-0 222 237 
5 65 218 168 225 227 
6 (0)* 233 39 235 230 
7 6:7 223 12-7 214 211 
Mean 222 Mean 216 


* Signifies that the estimated value of b is negative, 


to the left-hand signal when it came second after an interval of 0-10 вес. А 
comparison of these values with those of a+b suggests that the small values ds 
ЕТФ can be explained in terms of subject II’s expectancy. It would still be 
an open question as to why there seems to be no interaction of responses when 
У, is a left-hand signal. The lags are small and are not shown. 

Two seemingly interesting practice effects can be seen in Table 2 (b). 
It appears that in both Ss a+b is not appreciably 


k я at 
affected by practice and tha 
it is about the same for both hands. 


However, in the unpractised (left) hand 
of both Ss there is an approximately linear reduction in b with practice. 018 
the index of the expectancy effect, and the fact that it is different for both hands 
Suggests that SR may be related to muscular tension. One might conclude 
from these two findings that there is an upper limit to the rate of responding 
which is resistant to training, but S improves his average performance Wit 
practice by learning to react more quickly to unexpected stimuli. 


7. THE INTERACTION оғ TWO RESPONSES 

7.1. Statement and discussion of a inodel at 
Тһе positive lags referred to above will be interpreted as evidence me 
there is some form of response interaction when the making of two response 
overlapsintime. The original statement of the intermittency hypothesis implies 
the extreme assumption that 5 attends to one response until it is made and Qe 
attends to the other. This assumption as it i 


! stands is incompatible with a 
expectancy approach outlined above because t 


he values of the lags obtained 4 
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significantly less than (RT) —7). However, intermittent behaviour of some 
sort seems to be a simple explanation of response interaction. 

If X, is presented before R, is completed, it can be supposed that S may 
be able to attend to R, for some of the time up to the completion of Ry. This 
could be the case, for instance, if a response consists of a certain number of 
stages, some of which are also stages of the other response. Thus when there 
are two responses to be made, there would be three types of stages, one type 
comprising stages common to both responses, and the other two comprising 
those stages belonging to one response and not the other; with these stages 
being performed serially. Given these three types of stages, response interaction 
could then be described in terms of the process determining the order in which 
these stages are performed. This process should take account of at least two 
features. The first is the possibility that ‘preference ' is given to the preparation 
of the stages of R,, because this response was the first one started. The second 
is the possibility that a stage of a given response is likely to be followed by a 
stage of the same response because of some form of association among stages 
of the same response. ‘The relevant property of a stage Is its type, so 1t 1s 
3-state process giving the sequence of types. 
One such process is the 3-state Markov chain in which states 0,1,2 correspond 
to the types of stages belonging to both R, and Re, to R, and not R> s Ry p^ 
not R, respectively. If the transition matrix 1s denoted by (qu), a за der 
probability of going from type i to ty 2, then ' pretere: 

° association? would be reflected by the ine 


necessary to consider only the 


pe j, i J =0, 1, 
qualities 


Qo 2 Чо? 
quot Iu 2 h° 


Qao + 4222 421° 


considered where qoj = dij 9) for 


Іп what follows, the simpler case will be 
J50, 3,2: 

The organization of a response will 
(thought of as being large), of stages whicl 


i random time 5. Because of preparation, 1 their execution starts әгі. 
hi (< у) stages of R, remain to be performed and the 


s in ee P ie ga and, tbc, of tbe tpn охла а 
2, is presented at /-- and it is supposed that by that = is roe н 
1 remain to be performed, and that 2, (<o) stages OF ^ emery 
Preparation). The random process which starts at time #+2 1 pp etit 
the following. Starting from the instant that the duo Des дзе je 
complete, the next stage performed belongs to R, (but not R.) W. ^ Probability 
Y, to К, (but not R,) with probability Y, or to both R, and Аз wit oe meets 
c У,). The process continues until there are no more stages 0 ridi 
left, from which time the stages belonging to the other response are P 
Serially, 


ded as having a number, 7 
erially, each requiring 


5 iÍ ime 4 
when Ху 18 presented at time f, 


be regar 
h are performed s 
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The RT data available would allow the moments of ón; and the values 5 
Ү,і-1,2ю be calculated. If the Y; are allowed to vary among Ss, the mode 
can be made to apply to three types of experimental results: - 

(i) No lags in either РТО) ог RT) It could be supposed here ben: 
Y,=Y.=0. In this case the overlapping portions of R, and К, are p 
together or grouped. Vince (1948) has stated that grouping might occur w Tis 
I is small because S perceives and responds to both signals together. 'l his 
would lead one to expect simultaneous completion of R, and Ry. The pue 
meaning of grouping is somewhat different, and one should not expect R, anc 
К, to be completed simultaneously since л; would probably be less than mo. ҒА 

(i) Lags in RT?) but not in RT. It could now be supposed oe 
Y,—0. If R, is completed before R, and when R, is completed zn e = 
R, (but not R;) and (n, —m,) stages of both R, and R, have been prepared, үе 
тү is a binomial random variable with parameters Y, and лу. Writing 1- 
E[8(n,' —n,)], it can be seen that 


RT = КТ + Y (RTO — I), 


where RT) is the normal RT to X, and RT) is the observed RT to У. 
(iii) Lags in both RT® and RT, 
and Y,-0 and Y, Y,« 1. 


; š š — s been 
Іп the last two cases intermittency couched in probabilistic terms has bee 
introduced. 


Here it would be supposed that Yi 


7.2. Approximate distribution of RT 
Suppose that 8 has the density function 
f(s) - B e. 
Let L, and L, denote the lags in RT and RTC) respectively, ‘Then 
2-ҚАЕТ,Ф)-1, and var (RT 0)=0} = 5 ™ 
B F: N х В p (7.2.1) 
Ny А 2 Ng 
z "E(RT.O), and var (RT Oa, = 2. 
pom PUn w^ д 
at 
ge of R, (but not R,) is performed ir 
2(4)-1— Y, If the probability that R, is completed before Ry is аа 
the number, ә” of Stages performed up to the completion of R, has angers a 
the same distribution as the waiting time, v, up to the 2,th occurrence of A wh 
ny is infinite. The probability generating function (p.g.f.) for v is 
1— Ye 
P(z)= (1-¥,)2 
(ә) | yz]. 
the approximate Characteristic function (с.Ғ.) for RT,—I is 


C(0) = Р[ф(8)], 


Let A denote the event that a sta 
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where ф(8) = B(B + 0). 
B(1— Y.) 71" 
e C(0)=| ——— | › 7 
t- [ven pen 
which, not surprisingly, is the c.f. for the gamma distribution. From (7.2.1) 
and (7.2.2) it follows that 
Y, 
E(L,) = iY. [E(RT 9) - I]. (7.2.3) 
Now let m, be the number of stages belonging 
by the time R, is completed. If ag is infinite, 7; 
Q()- [1 - Y) + Nis)". 
negative exponential random variables. 
], from which is obtained 


to R, (but not R.) performed 
has the binomial p.g.f. 


It is clear that Lg is the sum of zn 
Therefore, the approximate c.f. for Ly is (6/20) 
E(L;) Y E(RT,) -1], 


n (7.24) 
Y z 
var (L,) = -Fa 
Since var (АТ) = 2, + var (La), then, from eqns. (7.2.1) and (7.2.4), the 
following inequality is obtained: 
(7.2.5) 


оз, < (RTO) < 2x + V2 — OLET 
"То find the covariance between RT, and RT, first define RTO” -RTO-I. 


Given that v=j, 

RTG = ВТ +the sum of m -(j-7m) nega 
variables which are independent of RTO. 

E(RTO'RTO|o-j) = ЕКТ] 


tive exponential random 
)+ BART е j) E (n, +") 
- wll 4 ng Үш). 
Tur Р 
E(RTO'RTO)) = а png Ут). 


Noting that B(RT®) = ftra Bem] 
zo (uc Үт), 
(RT y”) it is found that 
=cov (RT, RT®) 
E(RTO)-I „БА (7.2.6) 


= RTO) 


from the data using eqn 


and approximating B ! by оз [E 
cov (RTO, RT?) 


з. (7.2.3) and (7.2.4), 


n be devised. Тһе first is stated in 
s satisfied in most of the 


ht-hand inequality is not. 


Үү and Y, can be estimated 
and certain crude checks of the model ca devi 
the inequality (7.2.5). ‘The left-hand inequality is 
experiments of Halliday et al. and Borger, but the rig 
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Also in the data of Borger, L, is sometimes greater than (RTO -1) which Т 
inconsistent with (7.2.4). However, because of the large variability prd 
present in RT data, it is felt that these inconsistencies do not necessarily reflec 
the inadequacy of the model. This could perhaps be better assessed by com- 
paring empirical and theoretical distribution functions. 


8. CONCLUDING REMARKS 


The first concern of the present study has been to treat mathematically the 
influence of stimulus probability on АТ, and a prerequisite of the approach 1s 
the existence of a subjective correlate of temporal uncertainty of the signal. 
The suggestion has already been made (Audley, 1963) that the uc dn 
measure of temporal uncertainty is the conditional probability that a Pd 
would be presented given that it has not yet been presented. ‘This conditions: 
probability has been called here the expectancy function and it has been the 
variable most exploited in the analysis. ‘I'he experiments of Walter et al. (1964) 
and Sutton et al. (1965) Suggest that a subjective correlate of signal uncertainty, 
SR, can be found in the cortical activity of the subject. It is clear, however, 
that there is a need for more empirical work designed to determine both the 


2 : : 1 
nature of SR and its relation to the expectancy function, on the one hand, anc 
RT on the other. 


In Sections 4 and 5 ways have been ex 
aspects such as time and probability estim 
curves. Іп the approach, 
situations in which the fore 


amined of incorporating subjective 
ation into a theoretical study of RT 
use has been made of a distinction between those 
period distribution is discrete and those in which the 
distribution is continuous. In the former situations the points of the distribution 
are points at which SR reaches local maxima, and it follows from this that errors 
in time estimation lead to reduced preparedness. Оп the other hand, lower Do 
paredness cannot be the result of errors in probability estimation alone unless 
the subjective probability estimate has a negative bias. If, however, the fore- 
period distribution is continuous wit 
points at which SR should be 
idea of time estimation seems 
range has to be estimated, but 
the experimental pro 
estimate at /, and it i 
It is not argued that 
psychologically, but 


h finite range, there are no preferred pot 
maximized and within the foreperiod range rie 
irrelevant. "о be sure, the beginning of e 
after then the subjective transformation, m 
bability density function, /%(4), is essentially a probabi ui 
5 this transformation which, by hypothesis, determines 4 i 
discrete and continuous distributions arc necessarily sae 
only that if they are, then this distinction is important 
the theorist, because his assumptions about SR are different in the two cases 24 
is the relative significance of time and probability estimation. ve 
Two ways of treating the case of continuous foreperiod distributions x ts 
been discussed. One approach (Section 4.4) has been to assume that S adjus 
his SR intermittently, as in the discrete case, and that his SR remains consta? 
between the times of adjustment. It is shown that the existing empirical 
curves would, on the basis of this model, imply that S's rate of adjusting 
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decreases with time. The other approach, as outlined in Section 5, is analogous 
to the approach used for the discrete case. It assumes that RT() is a Lan 
function of p(t)-!, where p(t) is the continuous expectancy function. ж 
. I'he other topic for which two ‘ competing ? treatments have been offered 
is that of catch trials. In section 4.3 time-estimation errors were neglected and 
it was assumed that probability estimation is affected by changing subjective 
belief in the hypothesis that a trial is a catch trial. In the other treatment, 


mentioned in Section 5, it was assumed that probability estimation is the result of 

У 2a , " x М x B PE n 4 М 

a ‘mixing’ of the objective foreperiod distribution and another distribution, 
ertain parameters of this 


and that the introduction of catch trials changes c 
mixed’ distribution. The assumptions made about subjective belief and 
about ‘ mixed ' distributions have, at present, no empirical justification, but it is 
felt that the theoretical analysis may have some value if only because it demon- 
strates the possibility of accounting for certain experimental facts. 

Studies of reactions to discrete pairs of signals arose out of attempts to 
analyse human limitations in continuous performance tasks. Because of this, 


it is perhaps not surprising that the intermittency hypothesis is a statement about 
Thus the first signal, X, is 


the limitations of the central nervous system. 
regarded as highly ' compelling ' from S's point of view, having a distracting 
effect and requiring some sort of discrimination whether or not a response to it 
is required. The central mechanisms would be ‘captured’ by Ул, and this 
would give rise to a delay in RT since no two central organizing times can 
overlap (Davis, 1959). In the expectancy approach adopted here, X, is ' com- 
pelling ' only in so far as it provides the last warning signal after which S can 
expect Xs, and it is not assumed that the central mechanisms are necessarily 
limited to dealing with one signal at a time. So the intermittency studies 
differ from the RT-foreperiod studies only because a response to X, is required, 
and to sce if this difference affects RT the difference between the RT to 2, 
in the two situations can be measured. ‘This difference, which has been called 
the lag, then measures the degree of interaction between the making of two 
responses. ‘The model for response tion which has been proposed is 
based on the idea that the central mechanisms while attending to one response 
can, for some of the time, attend to another response as well. This idea is in а 
Sense a probabilistic analogue of the intermittency hypothesis, and by varying 
the probabilities involved, the model can account for different types of reactions. 
This is another example in which a theoretical model is useful in accounting for 


qualitative features of existing RT data, and this has been the main aim of this 


Paper, 


interac 
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A DISCRETE PREDICTING CONTROLLER 


Ву LISANNE BAINBRIDGE 


De В š š та 
epartment of Industrial Engineering, University of California, Berkeley * 


Ж wee bates — an attempt to develop a control device which has some 
oe ы “. nasi processing rather than of the lead-lag formulations 
"her tne гу. һе controller 8 sensory limitations, the way in which it in- 
сент. p Жа behaviour of the system under control, and its control 
mic a enc ed. <A digital computer simulation of the controller has been 

jected to two performance tests; and the limitations of its behaviour are dis- 


cussed, 


1. INTRODUCTION 


ime models of the human operator controlling a machine or 
Ser, Ebr ing task have previously been. made in the language of control 
с ial st h is convenient when the model is used to represent the operator in 
ы ramos of man-machine relations as all the parts can be described in 
i s dg implications. А control situation 15 shown in Figure 1. The 
i hic i E C, ‘the system output value, to the required value A. The 
difference = ез a corrective change of the system input M according to the 
eiiis I een C and A. А control theory model of the controller is an 
седі relating the controller input (C — A) to its output M, and is obtained 

ytically from records of the controller's behaviour. А useful review of this 


approach is given by Sheridan (1962). 


Controller 


Identification of symbols in a car-driving example: 


Fic 
URE 1, Controller-system relations. 
required position on road 


A target value 

М manipulated variable steering-wheel angle 
System car-steering sub-system 
c controlled variable actual position on road. 


ays in which automatic and human con- 
vidence. Figure 2 gives an industrial 
mill (Beishon, 1966, 1967). Itis 


-— Table 1 compares some of the w 
ers differ, as shown by experimental e 


exa ravi 
èle mple of human control activity in a paper 
ar that although an automatic control model can behave in much the same way 


as : ; 

fe a human controller the underlying mechanisms are not the same. This paper 

m an attempt to synthesize à controller, called FRED, with the human 
aracteristics given in Table 1. 


1 Now at the Department of Psychology, University of Bristol. 


TABLE 1. 


PROPERTY 


controlled variable 


rate of change of con- 
trolled variable 


timing of measures 


manipulated variable 


timing of corrective 
actions 


control type 


size of corrective 
action obtained from 


FRED can control an 
order system in response 
will be described later, co 
а system at steady state 
time) to a step change i 
A higher-order system 


a human controller can m 


Lisanne Bainbridge 
COMPARISON OF AUTOMATIC AND HUMAN CONTROLLERS 


AUTOMATIC CONTROLLER HUMAN CONTROLLER 


Input Variable Measurement 


i res isplay 
any value on continuum value to nearest : dam 
scale unit (e.g. McCormick, 

1964) 


any value on continuum if rate variable not displayed 
| controller makes absolute 
judgements limited to 742 
values (Miller, 1956) 


continuous, or sampled at equal 
time intervals (sampled data con- 
trollers) 


distribution of sampling , 
rates rather than single rate 
(Senders, 1964) 


Response Setting 


i i iti ich are 
any value on continuum discrete positions which x 
closer near target Và 


(Table 6) 


continuous, or at regular intervals intermittent (Figure 2) 


Controller Operation 
feed-back, responds to present 


feed-back (Figure 2) 
error 


lic- 
feed-forward, uses pre е 
tions in selecting соггес 
action (Poulton, 1957) 


solution of controller equation ? 


chooses values so that p 
trolled variable will = 
overshoot target (Beisho™ 
1966 a) 


y linear slow response over-damped second or high 
to a single-step change of А. The simulation, ИЕ, 
ntained a second-order system. The response at pet 
(i.e. when the output value has not changed for 80 
n its input M is given by the dotted lines in Figure 
has the same type of behaviour. If FRED is a use 


ha 
apable of controlling the more complex systems | 
апаре. 


ful 
t 
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FIGURE 2 
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to 
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FRED reads the present system output value Cp from the interface, 
compares this with A and makes an absolute judgement of C’, the rate of change 
of C. It then predicts the effect of various M settings on C and uses these 
predictions to choose a sequence of Л/ values, and their timing, which will bring 
C to rest at 4 in minimum time. Each member of the sequence has a predicted 
effect on the system which is used as the basis for choosing the next value. 
As the control sequence allows for predicted behaviour this is feedforward 
control. When FRED is about to make the next action of the sequence it checks 
that the system is at the predicted value; if it is, FRED continues, otherwise it 
chooses a new sequence of M values for control from this Ср. ‘This is feedback 
control as FRED checks the correctness of its actions. 

The basic sequence of M values (КІ, R2, R3 and R4) is shown in Figure 4. 
Тһе top drawing shows the pattern of values chosen when Ср is stationary ог 
moving towards 4. R2 moves the system output towards A, R3 brings it to a 
stop at A, and R4 maintains it at this value. When Cp is moving away from А a 
first value R1 brings C to a halt at C}; and from this position R2 is used. 


R2 onward value 
| 
Ë | | 
= 
— + 
р ЁТ? ' RT3 ' RR | t 
Original sample Feedback samples 
Mp 


^1 turn around value 
n 


Original An RT2 , t 
dianat pai 2, RT3 RT4 t 


Feedback samples 


FIGURE 4. Basic controller strategy. 
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2. CONTROLLER SENSITIVITY FOR INTERNAL VARIABLES 
idis e System output (C), and time (f), which are continuous variables, are 
: ; : : 
isplayed ' to FRED it can only obtain their values to the nearest interface unit 


C' is not displayed, and FRED obtains its value by comparing C values at 
One of FRED's basic features is that it can use only a limited 


number of characteristic values for a variable in its internal operations. The 
rate value is obtained by internal operations; this is equivalent to the bares 
process of absolute judgement, so the controller is only allowed to distinguish 
between seven different C’ values at any one time—three increasing rates, one 
stationary and three decreasing rates. Two factors determine the actual values 
which can be discriminated: 

Ratio Rate discrimination in one direct 
AC'|C' =w. A series of physical magnitu 
is given by 


successive times. 


ion is limited by a Weber fraction, 
des which are discriminably different 


п-0, 1, 2. 

18 be distinguished аге 1, 2, 4, 
lled ‘slow’ by FRED, the 
value. FRED uses 
d on human Weber 


C, = Cy (0 +)", 
=1, actual values which cou 
4, 8, 16, where the first is ca 
he third ‘fast’, whatever its actual 
Work is neede 


For instance, with w 
or 2, 4, 8, or 3, 6, 12, or 
second ‘medium’ and t 
w=}, a value which was chosen arbitrarily. 
fractions in this context. 

Adaptation level, AL As can be 5 
ratio basis for discrimination is that the same physi 


category in different discriminations. Whether an 
у depends on the adaptation 


called fast, medium or slov ; 
which FRED selects according to the range of C’ values ге 


Figure 5). 


result of using a 
cal value can fall in a different 
an actual rate of 4 units is 
level of the controller, 
levant at the time (see 


een from the example, a 


/ characteristic values 


limit on fast can be used 


ALz x/á 


AL=x/2 


discrete 
categories 


AL=x | 


medium 


' 
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The physical values which can be discriminated at a given AL are = 
characteristic values used at this time by the controller, and these mark t e 
division of the continuous variable into sections. All actual values in a given 
section are discriminated as having the characteristic value, which is known by 
the controller. The AL is arbitrarily specified as the medium rate. 


Nesting The effect of the adaptation level is to alter the position of sections ме 
the continuous variable, so changing absolute but not relative accuracy. Figure 
5 shows three AL’s which are related in such a way that they give a nested 
succession of discriminable categories. ‘his nesting is made possible by the 
ratio mechanism. "Го remain іп this succession FRED cannot change adaptation 
level continuously but only to discrete values that are themselves discriminable. 
The joint effect of ratio discrimination and AL is that although a limited number 
of discrete categories are available for discrimination they can be used to cover 
the entire variable range or to allow fine discrimination, and using discrete 
adaptation levels maintains the coherence of these changes. 


3. FnED's REPRESENTATION OF FUTURE SYSTEM BEHAVIOUR 
FRED chooses control actions by predicting the effects of possible actions. 
То make these predictions it must contain some standard templates of the effect 
of control actions on system output. As there is no experimental evidence on 


human templates, FRED's were devised to describe system behaviour given the 
above limitations on internal accuracy. 


SIMPLE TRAJECTORIES 
The response of the system at steady 


already been given in Figure 3. For simplicity, at the steady state C value for а 
given M, C, = M,; in real control tasks the transformation from C scale to M 
scale is one of the sources of task difficulty. When M is changed to another 


value М1, C changes until it is at the value corresponding to AM, С,= Mr 


Whatever the distance between C, and С,, the size of the trajectory, C takes 


the same time to reach its new value, during which the С” changes follow а 
standard pattern of relative values with tin 


пе, although the actual values depend 
on the trajectory size. 

As FRED divides the rate 

ratio we can use this t 
Figure 3). Fora given size of simple trajectory the rates will range from J e 
some С” ах he upper limit of the fast category, t is 
the ratio; the rate will be slow (в101) for a we 
(med2), fast, medium (med4), slow (5105). es 
ategory will be the same whatever the size of * 
S ^4 5 will be called the time (t) template. fot 
determining C’max determines the AL used is 
is change. ‘Trajectory size is C’max/k, where 
Tate constant and AL, — V(k x size). 


state to a step change in input һа5 


the system fast- 
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m. кыы template for the proportion of the trajectory passed through 

не 2 of the ¢ template сап be obtained from the template and rate 

zd ts. is is called the C template, CC. Тһе central value of C in each 
ion is used as the characteristic value for C in that section. 


К. ; 
{= rap tọ (i п)-(1, 2), (2, 1), (3, 0), (4 1), (5, 2). 
and 

5 

* CL 


assumes that FRED discriminates C and t 
y to the nearest interface unit. Part 
will be passed while C traverses the 
s become smaller med2 and 4 will also 
before a change in C can be detected 
the above method gives no way of 


Unfortunately this representation 
Perfectly, while it actually knows them onl 
of the slow sections of the time template 
first or final unit of a change; as trajectorie 
be affected, and so оп. Тһе? which passes 
Will obviously vary with trajectory size; 
determining this time but does give a way of identifying the trajectory size at 
which different sections begin to be affected. The minimum trajectory х which 
can be represented correctly, in which C traverses the C display unit (1) during 


^ is given by 1/CC,. 


Cy Mo 


Стах | 
for Cp 
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TRAJECTORIES AFTER A STEP CHANGE IN M WHILE С IS CHANGING 
There are three situations to be considered: 


(1) Addition—the М change to a new value M, and movement of C are in the 
same direction (see Figure 6); 


(2) M is moved in the opposite direction to С; 
(а) Subtraction — Ср is less than the steady-state value equivalent to Me 
(see Figure 7); 
(b) Turn around — Ср is 
(see Figure 8). 
. Addition When M, is further from Cp 


past the steady-state value equivalent to Me 


than M, the rate pattern followed by C 
after the change is the same as the rate pattern for a simple control change of 
size Cp to С-М,, this size determines the maximum rate. The new rate 
pattern continues from Cy’ (see Figure 6). 

The two trajectories will be discriminable only if the characteristic value of 


the rate category of Ср” in the first is the same as that of its rate category in the 
second, although the 


‹ categories have different names, as only values in ratio 10 
the characteristic values of the first trajectory can be discriminated. Cp’, which 
may be in any category of the present trajectory, can enter three different next 


trajectories, as 5101, med2, or fast, so Cy’ can be a member of the three adaptation 
levels 


Cy (1 +z)" 

with n= 1,0, —1 respectively. The largest of these trajectories is the one which 
Су Joins in the slow section, i.e. Cy'(1+w)2/k. Also, as FRED cannot dis- 
criminate between parts of a trajectory falling in one section (see Figure 3) it will 


ж. жана 
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not be able to predict the timing of future behaviour with any accuracy unless 
the control change is made at a discriminable point in the new trajectory, the 
Junction of two ¢ template sections. While choosing a control sequence FRED 
is restricted to considering trajectories which it can represent clearly. 
Subtraction The effect of reducing the distance between Cp and C = M depends 
on whether C’max for the second trajectory has already been passed in the first. 
If it has not, system behaviour after the change is as for addition (see Figure 7 (i)). 
If this С” шах has been passed, the new trajectory is joined during deceleration 
instead of acceleration. The deceleration curve has a standard shape, de- 
celeration from Ср to C2 M is the same for a given Ср to C=M distance 
whatever the original trajectory size. This means that when Ср to Cz M is 
reduced there is a short period of rapid deceleration until the correct C' for the 
new Ср to Cz M is attained and from this value the curve follows the standard 
form (see Figure 7 (ii). Arbitrarily, deceleration time 18 taken as one t rs 
and the length of C traversed during this period as the average of Cp and rs 
target rate over the period. Again FRED can only reduce Min — v 
values which will give, at the end of rapid deceleration, ac ens tha TA 
distinguished from Cy’, i.e. Ср (1 +)” for n=1, 2, being the medium ап 


rates of the trajectory in which Ср = C'max: 


Turn Around If C has passed C=My ue s 
trajectory follows the simple form from the turn around point Ст, 


at which C’=0, to M, (sec Figure 8). 
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The turn around time, and the distance from Cp to C. 


“р are proportional to 
Cp to C=M,/C=M, to Cy. А single value combining both effects is used for 


each of the ¢ and C turn around parameters, TK and CK. These ¢ and C values 
are obviously limited in accuracy to the ¢ and C interface units. As this is an 
internal manipulation it should be even more limited in accuracy, but this will 
not be investigated here. M, values with discriminably different effects could 
be used in a more elaborate version. 


4. THE STRATEGY USED By THE CONTROLLER 
FRED's control strategy has already been described and is shown in Figure 
4 and Tables 2 and 3. Some details of this strategy 
Conservatism R1 and R3 аге values used to 
end of scale values, but FRED is more cons 
the distance C traverses before С” = 


should be mentioned. 


bring C to a stop. They could be 
ervative and chooses values so that 


0 is greater than a minimum size. Experi- 
mental evidence about human behaviour in this situation is needed. T'he limits 


Were chosen arbitrarily: R1 so that (Ст to A) is greater than 2|2; R3 so that 
C is stopped from a distance greater than 2/5. 


Choice of R2 R2 and R3 are found by predicting the effect of an M (R2) setting 
in bringing C towards А, and then finding an M (R3) value which will stop C 
without overshooting A, basing this prediction on the characteristic value of C in 
the R2 trajectory section which contains A. If no R3 value does this, C values 
in sections further from A, up to the one containing Cp, are tested in turn until a 
section associated with an acceptable R3 value is found. R2 is used during 
control for R'T2, the time C takes to Dass from its present section in the R2 
trajectory to the section in which R3 is imposed, which is calculated using the t 
template. 
There would be many M settings for which an R3 v 
is required which will bring C to A in the sh 
therefore first selects a strategy for suggesting M values whose effects will be 
tested. Ав there is no evidence about human strategies, FRED's are based on 
the possibilities allowed by the system representations. These are either to 
increase C’ using the addition me i 


alue could be found, 
ortest time. FRED 


which also ignores the present 
St possible change, going to the 
trajectory. Deciding which of these strategies is most 
the relation between C, Cy and A, as summarized in 
Table 4, diio 


5. Tue CONTROLLER Stu 


The controller uses only arithmetic 


simulated in FORTRAN IV. The 


LATION 


and logical operations, so it is adequately 
System is simulated by expressing its 
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TABLE 2. Ust or CONTROL SEQUENCE 


1” 


Choose Control Sequence (see Table 3.) 


no 
< RI value found? 
yes 
use R1 for RT1 
М Ж по ев 
is Cy 2031? ------зіз Cr already passed? н 
yes 
no no 
а R2 value found? 
continue using R1 
yes until C =0 
use R2 for R'T2 


"m C be stopped ng 


without overshooting 4? 


yes 


no 
R3 value found? 


yes 
use R3 for RT3—f 


ы 
use R4 for КТА 
de steady aS. 
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END 
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ing dis 
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TABLE 3. SELECTION OF CONTROL 


no С'=0 ? yes 
C moving away yes 
from A? С-А?-----хоп target 


can C be stopped without 
overshooting A ? 


Ri such that (Cp—A)>1/2 .CC, 


Ср- Му 


ҺТІ-ТК. RG 


R2 and R3 calculated 
(see Table 4) 


Cp—-R2 


RTS-TK . е 


R2 value found ? 


yes no 
C’=0? 
yes 
no 
R4 =A 
R3 and RT3 
calculated R'T4=t; 
R4 ‘=A 


RT4=1, 


— = — a i мд СЫ, T eee 
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к function as a time series and calculating the convolution of this with 
the M series. The program is in two parts: (I) which calculates the weighting 


function, and uses this to obtain the system parameters used by FRED; and (11) 
the simulation of FRED and the system. FRED and the methods for obtaining 
ogram I are independent of the actual weighting function, 


system parameters in pr 
for any linear slow-r -damped second-or- 


and will be effective 
higher-order system. 


еѕропѕе Over 


'lanLE 4. CHOICE OF M Умле TO TEST AS R2 


is Cp’ =0 


no change adaptation level 
calculate trajectory size 
in which Ср is ‘slow’ 
calculate trajectory size. 
in which A is at end of fast 
A is A in med4 or 5105 of yes 
this trajectory ? 


no 
is M on scale? 
Addition Tests 
yes no 
test ра ? . 
Bent test maximum М strategy 
yos no 
test 


Subtraction Tests 


test 
. „mplate. 
A Section position found using CC template 


Progr ggi 
fu ogram I This program calculates { ‘ed at 1 
tl nction H(t) of this simulation as 4 time series spec! à ue of the series is k, 
he system time constants 7; and 72- 
H(t) = exp (-і/т) 


апа 
(-1,2.-2 


т . : 1-0 
he series is terminated when H(t) «0:01. 
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Қ.) measured in seconds, are marked by the positions of the rate constants 
in the H(t) series (see Figure 3). The ¢ unit is taken arbitrarily as the shortest 
of the ¢ template sections, measured in seconds. The time template is re- 
measured іп ¢ units for use by the controller, although it must be expressed in 
seconds for use with the weighting function. There are a number of places in 
the programs at which measures in interface units must be converted to system 
values as the system is also being simulated. For instance, the rate constants 


give rates measured in 1 sec. units, which must be converted to interface time 
units. 


Program H This is the main program of the controller and system simulation. 
As both controller and system are simulated, the system cannot progress while 
the controller is making its decisions, which happens in real-life situations. 
Instead, the simulation is in two parts; one selects the control sequence, as 
described in Table 3; the other directs the system simulation using these values, 
checks system performance at the predicted times and, if necessary, returns to 
the first part to repeat the control value computation, as shown in ‘Table 2. 
System values are corrected to the nearest interface unit by 


b [integer value of (a + b/2)/b] 
where a is the system value and b the interface unit measured in system units. 


6. PERFORMANCE 
Two different studies were made of the controller's performance. 


On-line Study Іп this study program ЇЇ was used to control C after a single 
step change in 4. One of these runs is shown in Figure 9 and shows clearly 
FRED’s inefficiency when C is near to the target value, due to the limits on the 


choice of control value and the turn around parameters. This will be discussed 
later. 


H t unit seconds 


404 
304 


А204------- 


104 


lI ДО 
F Sample times 
16 i i i 
URE 9, Simulation of controller's performance in response to a step change in A. 
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Systei j 

i Сыма ipae e сыр state (SS[AS) Study In this study only the control 

me міт par of program II (Table 3) was used. It suggested a 

pe gi Жос жей eris pe of a present system state, i.e. (C, C") pair, for each 

given in T 4s (С; 4 ) values. The first value of each sequence for A=50 is 
able 5 (а). "Тһе underlying strategies are shown in Table 5 (b). 


TABLE 5 
ABLE 5 (a). : : 
(a). SYSTEM STATE/ACTION STATE STUDY: FIRST M VALUES SUGGESTED BY FRED 


Cy 
Cy -6 -4 2 0 2 4 6 
0 100 100 100 88 = = = 
10 100 100 100 80 88 88 3 
20 100 100 100 73 98 98 93 
30 100 100 100 65 68 79 64 
40 68 68 57 58 65 63 0 
50 66 6 57 50 42 34 33 
60 100 37 35 42 40 27 27 
70 36 21 3235 0 0 0 
80 ДЕ 2 22 0 0 0 
90 96 12 1220 0 0 0 
iü = € == 2 0 0 0 
to t=0. 


Cy measured from t= —h 


VaLUEs IN TABLE 5 (a) 


TABLE 5 (b). STRATEGIES UNDERLYING 


15-221 0 2 4 6 
|с 
0 not possible 
10 R2 "HEN 
change addition, абайюп, R1 
ladaptation Ср= siol. (emed 
ri 
end of (level | stop trom мор 
> le R {stop from | fast [ota 
20 scale RI |fast " | 
subtraction, 
joined at fast 
Stop from fast 
— =” Ж 
30 subtraction, 
joined at 
med 4 
stop m m 
ы Жі ені = 
90 iR2 change ( сла cf scale 
conservative adaptation R3 
level 
RI stop from 
Ші тей? - mE ы 
50 R4 censervative RI 


Only half of the full table is 8 
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This is parallel to a technique used by Cooke (1965) with human subjects. 
The system used in his experiments was of the type that FRED can control, a 
water-bath containing an electric heating coil. ‘The subjects had to change the 
water temperature to a given value and maintain it there by controlling the 
power supply to the heating coil. After a number of training sessions with the 
system, Cooke gave the subjects a sheet marked with a system state matrix and 
asked them to put in each cell the control setting that they would use if the 
system was in this state. He called the cells which were completed with the 
same control setting an action state. One of these completed matrices is given 
in Table 6. Тһе water bath heats more quickly than it cools, while rising and 
falling are the same in the system controlled by FRED, so care should be taken 
when comparing the two 55/455. FRED could easily be made to control such a 
system if it contained two sets of system parameters. 


TABLE 6. CONTROL SETTINGS CHOSEN BY EXPERIENCED SUBJECT FOR 
STATES 
System: Water bath with electric heating coil (90° goal) (Cooke, 1965) 


A SYSTEM IN DIFFERENT 


TEMPERATURE RATE OF CHANGE 


fast slowly " А slowly fast 

TEMPERATURE down down statlonary up up 

71 

72 

73 

74 

75 

76 

77 200 

78 150 

a 200 150 

M 150 150 

32 150 120 

83 150 120 

84 150 120 

si 200 200 150 120 

= 150 150 120 100 

е a5 150 150 120 100 

ER T 150 150 120 100 

89 | > 120 105 100 95 

90 1 10 ur 105 100 95 

x. m? $ rc 

92 100 95 50 0 

95 95 95 0 

94 0 0 

95 

96 

97 

98 
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FRED had to assume an Mp value when making its decisions in this task, 
as the future behaviour of C in the present trajectory is relevant to the control 
decisions. It assumed that Cy' was the maximum rate of the present trajectory, 
and that Cp was in the centre of the fast section. Presumably the human 
subjects made some assumption about the Mp value, but nothing is known 
about this; their assumption might depend on the Ср and (Cp to A) values, and 
this dependence will influence the comparability of human and controller 
„performance. 


| 7. DISCUSSION 
The two performance studies raise many similar points. 


Controller sensitivity Although a basic feature of this controller is that it has 
limited ability to distinguish different values of an internal variable, this limited 
accuracy does not appear consistently in the present version, and should be 
extended to all variables considered. Only the rate variable has been categorized 
explicitly; some other variables have, been categorized implicitly as their values 
are obtained from calculations based on categorized values. "This is the case 
with М values, which are chosen by FRED on the basis of C’. As only a few 
С! values can be envisaged, only a few distances can be used. The other 
possible values do not 'exist' for the controller, so it shows categorization 
behaviour, using a number of characteristic values for this variable rather than 
“making an explicit categorization of the continuum. 

Although FRED can only use discrete categories, the ratio and adaptation 
level mechanisms together allow variation in the absolute accuracy of its dis- 
criminations, as discussed earlier. ‘This mechanism is an important aspect of 
the controller as it allows it to change its control accuracy. Choice of R2 value 
depends on the (Ср to A) distance, as this determines the control trajectory size. 
This size in turn determines the C’ values considered, and the adaptation level, 
so that judgements based on AL are made more accurately, as C draws nearer 


to A. 
Basing decisions on single characteristic values means that individual М 


values such as 97, 98, 99 are suggested; it would be more reasonable to use 
characteristic regions which would suggest a control region on the M scale, of 
which the characteristic value would actually be used, e.g., 100 in the above 
cases. Тһе size of these regions would depend оп AL and so would be smaller 
when Су is near A. This suggestion is a reminder of the complexity required 
in a digital device designed to simulate a range of values treated as a unit. 


Minimum trajectory limits In the present system, tı and k/(1+w)? are so small 
that z is large, which suggests that the w used here may not be suitable. It 
might also be necessary to use fewer categories to obtain discriminable slol and 
med2. Cooke (1965) found that his subjects could discriminate five rate values. 
Alternatively, discriminable values might be obtained with the same w and a 


much slower system, which it would have been expensive to simulate while 


48 Lisanne Bainbridge 


debugging the controller program, or a higher order system, in which acceleration 
takes up a greater proportion of the ¢ template. 


The effect of using a constant proportion of the minimum trajectory as а” 
limit on control choices appears both in (C=40, C’= 6) and the mirror cell in the 
system state table, and in the final part of the on-line run, where it was not 
possible for FRED to choose an R2 value once С was nearer to A than 3/5, the 
limit on R3. Similarly R1 must cause turn around at least 2/2 from A, 17 units 
in this system, so conservative R1 values are chosen for С nearer to А than 17 
units. These limits were chosen arbitrarily and the controller’s performance 
suggests that a varying limit based on AL would be more efficient. 


Turn around parameters Тһе turn around parameters do not give good pre- 
dictions when compared with actual System behaviour in the on-line runs, 
especially at extreme values. This is the cause of the entry in (C— 10, С” —6), 
where the parameters predicted that C could not be stopped without over- 
shooting A, so a conservative R1 was chosen to stop C at 2/2 beyond A. 


When FRED is testing R2 and R3 values while choosing a control sequence, 
if the turn around parameters are smaller than those which correctly predict sys- 
tem performance, R2 values will be accepted that will actually cause C to over- 
shoot 4; while if they are larger, C will not have reached 4 when FRED looks 
at the effects of control. In both cases the approach to A will be inefficient. 
FRED also uses the turn around parameters during control, when checking the 
effect of using R2 (note 1 to Table 2). In this case, if the parameter is too small, 
FRED will make frequent samples and recalculations of its strategy; while if it is 
too large, C will always overshoot 4. 


А single parameter is obviously an oversimplified description of real system 
behaviour, but if the controller is modelling methods of prediction rather than 
attempting to make perfect predictions, this is not a relevant criticism. The 
method chosen has a great influence on the effectiveness of FRED's decisions 
and the pattern of M values in the 55/А5. The controller could make turn 
around predictions in other ways. For instance, it could assume a single value 
for turn around distance and time, independent of the forward and reverse 
distances, or choose R1 and R3 values which would give a standard turn around 
time rather than a standard turn around distance, or use an R1 value which gave 
a standard turn around distance rather than a standard distance of C, from А. 


The response to different rates at the same C value In the human SS/AS different 
rates are controlled by different suggested M values which change monotonically 
acrossthetable. Inthe controller, because of the nested nature of rate categoriza- 
tion, approximately the same M values are suggested in response to different 
rates on the basis of different Strategies. This in combination with uncategorized 
M values gives rather irregular numbers in Table 5 (a), although Table 5 (b) 
shows that the strategies change in a regular manner. ALs associated with these 
Strategies are different, so that some conservatism based on adaptation level 


weenn 
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might give a more realistic performance. ‘This similarity of responses might 
also depend in part on the turn around parameters, and on differences in Mp 
values assumed by the controller and the human subject. 

The response to different C values at the same rate Controller values are again 
not monotonic, although strategies change regularly. An exception is the 
relation between M and Cp for C’=0, which is linear. in the human SS/AS 
this relation is sigmoidal, with greater sensitivity to M values ав C is nearer the 
target value. Again, more realistic behaviour might be obtained from a con- 
troller with some conservatism depending on adaptation level. However, the 
success of FRED as a model for human performance would then depend on the 
modelling of conservatism rather than on the strategies used in its present form. 


8. CONCLUSIONS 

The purpose of this work was to develop a controller with some human 
characteristics, and thereby to investigate these characteristics and show that 
they imply others. ‘The basic feature of the controller is a device which can only 
distinguish between a limited number of values at one time but can discriminate 
at any level of accuracy if these values are determined by ratios and an adaptation 
level; if w and AL are involved in the processes of control decision then control 
accuracy is necessarily similarly affected. It would be useful to replace FRED’s 
more arbitrary features by data on human performance. For instance, do 
human controllers show this nested type of discrimination for continuous 
variables, and the limitations on prediction which follow from it? Do human 
controllers use simple categorized templates to make predictions? And if so, 
in what way do they use them; and do the templates determine the timing of 
their samples and corrective actions? This paper demonstrates that a device of 
the kind described can control a system, and so shows that it would be meaningful 
to investigate human control behaviour in terms of such a model. 
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TYPE I ERRORS AND THEIR DECISION BASIS 


Ву К. S. RopGER 
University of Sydney 


It is proposed that an error-rate I should be set for each statistical decision 
to be made. Methods for testing statistical hypotheses in the context of analysis 
of variance are reviewed and evaluated from the decision-based error-rate view- 
point. If decisions are the proper bases for the choice of an error-rate, the 
available methods for testing any or all linear contrasts or linear functions of 
the means are unacceptable. Two new methods (R and D) are proposed for 
testing all contrasts or linear functions. One implication of the arguments given 
here is that, for most purposes, research workers have been using the wrong 
tables of F to test their statistical hypotheses. 


1. INTRODUCTION 


Ryan (1959) brought to the notice of psychologists the question of the 
basis for the choice of an error-rate. He argued that the experiment should be 
accepted as the basis. А single error-rate should be chosen for each experiment, 
this being the probability of making at least one false decision. But the definition 
of an experiment is usually arbitrary and the experimental limits variable 
from occasion to occasion. These are practical grounds for questioning the 
unqualified acceptance of experiment-based error-rates. Theoretically, the use 
of the experiment as the basis for the choice of an error-rate seems arbitrary 
and unnatural, ‘The bases for choosing error-rates must surely be the occasions 


when errors may be made—decisions. 

This point of view applies to testing a 
three cases are distinguished later in the pape 
hypotheses, goodness-of-fit tests and testing an unlimited number of hypotheses. 
These methods are elaborated solely in terms of one-way, univariate analyses 
of variance (with fixed treatments), but the basic idea of decision-based error-rates 


П types of statistical hypotheses and 
r—testing a prescribed set of 


` applies in all statistical contexts, in multi-classification designs and multivariate 


models, in the analysis of attributes by their frequency of occurrence as well as 


in the analyses of normal variates. 
It will be assumed that there are а treatment groups or populations with 
means py and sample means М; (j= 1,2,...,a). Samples will be considered 


to be random with equal replication sizes л, and to have normal error. 


2. CONTRASTS 
Тһе most frequently tested linear statistical hypothesis about the means 
of normal variates can be expressed as a contrast. The general form for the ith 


contrast is 
к= са + Cioka + eee + сана = 0. (1) 
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The coefficients c; are chosen by the investigator because of their relevance to 
the purpose of his study. The restrictions on the choice of the ci's are that 
they be real numbers, not all zero, such that Усу = 0. 


J 
If, for fixed 7, one value of су — 1, another су= —1 and all other су = 0, the 
contrast is a simple difference between two means and is termed a comparison. 
"Гһе contrasts 


к-ш-рә-0) x, 0 5(u pg) Rs 705 
and 
Kg pa o + Hg 7 3,705 
for a —4, have contrast coefficients and sums of squares 


¢y=1, —1, 0, 0 and >e = 2; сәі-0:5, 0:5, —1, 0 and Yeo? = 1:5 
j 1 


апа 
€35,71, 1, 1, —3 and Усу? = 12. 
2 
A component of variation is а second-degree expression which is equivalent 
to a contrast, since a test of the component will always yield the same decision 
as a test of its corresponding contrast. ‘The 7th component is 


Atc (У сии) ГУ су? тк [Y ci? =0. (2) 
2 j $ 


If several contrasts are specified for the same population means, the 
relations between them can be described by their intercorrelations. "Гһе corre- 
lation between any two contrasts к, and к; is 


раз Xie] Ус). (3) 
1 1 1 


When р; =0 the contrasts are orthogonal. When p,.2=1, the contrasts will 
be termed identical contrasts because their components of variation are identical. 
When 0<p,."<1, the contrasts are similar though different. The three 
contrasts quoted above are mutually orthogonal. Contrast к, is identical to 
ра tHe—24,=0 and ку is similar to p; —,:5—0, their correlation being + 0-866. 
For any contrast к; we may choose any real number w; and write v, to obtain 
an identical contrast. 


When а>2 an infinite number of different linear contrasts are possible 


among the a means. If the contrast coefficients for p contrasts are written as 
the elements of a matrix pCa, the rank of this matrix cannot exceed (а- 1), по 
matter what the size of р. This indicates that any contrast can be expressed as 
the weighted sum of a set of (a— 1) contrasts, the coefficient matrix of which has 
rank (a—1). It is for this reason that there are (a—1) degrees of freedom 


available for contrasts among а means. The sample estimators of contrasts and 
components of variation are 


ki= ci M, co M ,+ we. c аЙ; (4) 
and 


Li-n(Y. cuM;j2|y ei = пу сїў. 6) 
j j $ 


М 
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The component estimators corresponding to кү and к above аге 

І-т(М,- Msy|2 
апа 

La =n[0-5(M; + М) — Мәр 1-5 = [(51, + M3)/2 — Ма 12) + (1/п)]. 

If each of these were divided by the estimate of error variance, they would be 
identical to the squares of ¢ tests. 

If p=(a—1) contrasts are specified, with coefficient matrix rank (a — 1), the 
corrected sum of squares between sample means (SSB) can be expressed in 
terms of these p contrasts. Let E, be a vector, the elements of which are the 
sample values of the p contrasts. Then 

SSB = к, (Са aC'y) 1 pki- (6) 
If the p contrasts are mutually orthogonal, („Са аС') is a diagonal and 
p 
SSB- X Li. (7) 
i=l 

Contrast coefficients may be chosen by the investigator either indepen- 
dently of the data used for testing purposes or dependent upon them. Which 
course the investigator chooses partly determines the appropriate sampling 
distribution for testing the contrasts: An interesting example of the second 
approach is the maximized, observed contrast, the coefficients of which, for 
a> 2, can be determined only from the data. The coefficients for the maximized 
contrast (кө) are 

Cm = M, — M., ca = M,— M., ... cma = Ma — M., 
where M. is the mean of all samples combined. For this contrast 
Іһ-әу(М,- М.) = пену Mj [E ens? = SSB. (8) 
j j j 
asts is the mean-square within treatments 
(MSW) in analysis of variance, which is an estimator of the population error 
variance o and has та-а(п- 1) degrees of freedom. А number of different 
sampling distributions may be used to test contrasts and these will not generally 
give identical decisions. The investigator must choose, independently of his data, 
which sampling distribution will be used.. The emphasis in this paper will be 
on the variance ratio distribution and the studentized range distribution for 
The general form of all tests may be written а, 


F-Li|, MSW | (9) 


The basic error term for contr 


testing contrasts. 


or 
qœ =4nkè (Eleu) MSW (10) 
j 
ed range distribution for range vı + 1, with error degrees 


where q is the studentiz | 
the variance, ratio distribution with v, and v, degrees of 


of freedom vs, and Fis 
freedom. ` f f 
ypotheses which are not contrasts are sometimes tested. 


Linear statistical h 
These are also of the form given by eqn. (1) but with Scy#0. Thus, for 


54 R. 5. Rodger 


a=4, it may be asserted that the average иу=0 by choosing coefficients 
Сој= 1, 1, 1, 1 and Yeqj?=4. Such hypotheses may be treated in exactly the 

j aq. : sai Š 
same way as contrasts, as described above. This particular hypothesis is 


orthogonal to each of the three contrasts specified earlier, as substitution of the 
coefficients іп eqn. (3) will show. Another of many possible orthogonal sets is 


IT 1 bb $3 


| 


| 

1020 1 || 
бе | | 
1-1 0 0| 

10 0 м 


A matrix of coefficients pCa, which contains at least one row with Хз 0, 


2 . 
cannot have a rank greater than a, no matter what the size of р. Such a matrix 
with rank a has a degrees of freedom available for tests carried out upon its 
corresponding components. Р 

If the uncorrected sum of squares between observed means (USSB) 18 


used rather than the corrected sum of squares (SSB), a matrix „Са with p=4 
and rank a will give 


USSB =mkp'(pCe С), (1) 
which is the rank a version of c 


qn. (6). If the a components are mutually 
orthogonal then 


USSB=5 L; (12) 
ізі 
which is the rank a version of eqn. (7). 
There is no absolute requirement, either fo, 
hypotheses, that the null case be s 
hypothesis is 


contrasts or other сағ 
pecified or tested. "Тһе most general lined 


Къ Cia... +Craptatdi=0, : 
and d; need not be zero. Тһе Sample estimator for the component which 
corresponds to this expression is 

Li-n(Y.egM; ау Уусу. m 
j j 


Тһе general form of the test for such 


s. а т le 
П а component is still eqn. (9). ‘The samp 
estimator of к; is 


hi- es M, esM, cb 6iuMa + di, (0) 
and this тау be used іп eqn. (10). 


3. PRESCRIBED TESTS 


The statistically most powerful Way to test contrasts or other linear pe 
ponents is to select a or fewer of them independently of the date used to të 
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them, hence preferably before such data are collected. If p<a are selected, 
their coefficient matrix having rank p, the components of variation for each may 
be inserted in eqn. (9), with v, — 1, and each component rejected for which the 
resulting F> F, , from the standard tables, with », — 1. These tests are exactly 
equivalent to ¢ tests and a is the type I error-rate for each decision made. This 
strategy uses the shortest possible confidence intervals for x;, and treats the 
experiment as p sub-experiments, designed as a unit for greater efficiency. 
This is a method recommended by Fisher (1951, pp. 198-203). 

If a mutually orthogonal components are selected and the probability of 
rejecting each in error set at o, the probability that each will be retained correctly 
is(1—a). These а probabilities are statistically independent; so the probability 
of retaining all components correctly is (1—a)'. The probability of not retaining 
all components correctly, i.e. rejecting at least one in error і81-(1-а). This 
is the experiment-based error-rate. If the selected components are not-mutually 
orthogonal, the probabilities (1- а) are not statistically independent and the 
experiment-based error-rate may be less than 1-(1-о). Even for mutually 
orthogonal contrasts the decisions that are made will not be mutually independent 
if all tests аге performed with a common MSW. Such dependence may be 
removed by splitting the sum of squares within treatments into а parts, at 
random, and using each of these parts to provide independent estimates of сг, 
each with (n — 1) degrees of freedom. Each of the a tests will now use a different 
MSW. In practice the use of a common MSW seems to have a negligible 
effect on the distribution of decision errors. 

It is true that as a increases, the probability of rejecting at least one com- 
ases: but this is equally true if we adopt the much less 
efficient scheme of keeping a small and increasing the number of experiments. 

If only contrasts are to be tested by the methods of this section, then a 
should be replaced by (a —1) іп the last two paragraphs. 


ing estimated components іп eqn. (9), the 


As an alternative to inserti 
estimates k; may be inserted in eqn. (10). If the square-roots of the resulting 


values of q? are extracted, each will have the studentized range distribution for a 
range of two with vy=a(n—1) degrees of freedom for error. Each contrast or 


linear hypothesis for which the observed q> qa should be rejected. Here а is 
the type I error-rate for each decision to be made. This test will give identical 


s to those arising from the use of eqn. (9). 


ponent in error also incre 


decision 


» 4. Goopness оғ FIT 


It occasionally happens that an experiment is designed to test the goodness- 
of-fit of observed data to some theoretical statement. А simple example is the 
hypothesis that all py — 0. This may be tested by setting USSB=L and 
inserting this in eqn. (9), with w =a. If the resulting F>F,_, for v,—a and 
vy=a(n—1), the hypothesis 1s rejected. The type I error-rate for this decision 
is а, This is a strictly one-decision procedure. Should further tests on the 
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that all jj are equal. This is tested by inserting the observed value of Lm in 
eqn. (9) with v, 2a —1 and rejecting the hypothesis if the resulting F> F 
¥;=a—1 and və=a(n—1). This is again a strictly one-decision process with a 
being the type I error-rate for that decision. 

In spite of the fact that the last-mentioned hypothesis (equivalent to 
Ag — 0) is very frequently tested, the view taken here is that it is only rarely 
an hypothesis of primary interest in empirical research. The evidence for this 
view is the high frequency with which predictable contrasts are tested in data 
for which Am — has been rejected. If Aa, = 0 is true, then all ^;— 0 are true. 
Conversely, if all A;=0 are true, then А, —() is true. ‘These two directions of 
logical argument have critically different Statistical Properties and it is essential 
to specify which terms are the premises and which the conclusions in an investi- 
gation before devising the appropriate statistical tests. When the A;=0 are 
primary, as they usually are, the methods of Sections 3 or 5 are appropriate for 
testing them, 

The goodness-of-fit Strategy is appropriate in some curve-fitting applica- 
tions. In such studies, levels of the treatment variable (аа, MM „ Ха) 
occur in ап hypothetical equation which relates them to the population means 
(ш). An example is the linear regression equation 


M= +e Xj. (15) 


If we write бу= Ху— X, where X is the mean of the Y 


's, the cy are real 
numbers, not all zero, and Усу = 0. 
j 


The су are the contrast coefficients for 
the linear trend component 


Ai =n(>x ину) [У eg? = 0. (16) 
7 " 


If the true relation between the Hj and Лу is given by eqn. (15), the variation 
of the M; around the trend line will be random and normal with an expected 


mean of zero. The observed amount of variation of the Му around the trend 
line may be measured by 


Dye. (17) 


If eqn. (15) is the only true опе, then A,=0 is false and A= is true. Should 
the former be retained or the latter rejected, сап. (15) must be discarded. 
A¿=0 may be tested by the methods of Section J. We test Ar=0 by 
inserting L, in сап. (9) with v, =(a—2) and reject it if the resulting F> Fv 
from the standard tables, with the Stated v, and vy. Once more « is the type I 


error-rate for this decision and further tests on the same data are not carried 
out, no matter which decision is made, 
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5. UNRESTRICTED TESTS 


It frequently happens that investigators wish to test more than (а- 1) 
contrasts, or more than а components at least one of which has Ус: 0. It is 


j 
also common for investigators to scek to reserve the right to search among their 


obtained data to find relations among the ну which can be interpreted. Exploring 


data can lead to abuses, one of which is the testing of sets of retained and rejected 
contrasts, with no systematic attempt at empirical rationalization or interpreta- 
tion, and another is exaggerated claims to logical status for empirical interpreta- 
tions placed on the decision sct without adequate support from prior or subse- 
quent research. ‘These abuses, though serious, are not especially statistical 
problems. The principal statistical problem is to provide a method for testing 
more than the prescribed number of components specified in Section 3, or for 
testing any components suggested by a scrutiny of the data with known error- 


rates o. 
Whenever p components are tested, the coefficient matrix of which has 


rank less than p, there exists the possibility that intransitive decisions may be 
made. An example of intransitive decisions is 

ш 71a 705 ш-нз0 and pa Hs 0. 

If the first two decisions аге made, then ш =: =з and ш-н. The 

ds introduced in this section do not prevent 


methods reviewed, and the new metho 
ft to the investigator to choose the decision 


such intransitivity happening. It is le 
sct or sets he will interpret. 

For many years it was the practice to insert Lm in eqn. (9) with v, —a— 1, 
and, if Am=0 was rejected, insert the various Li of interest in eqn. (9) with 
v,=1. This is the method of protected / tests. This procedure has been known 


to be unsatisfactory for over 30 years. Among other things, it will yield a 
distribution of decision errors which are not random in Aj; the decision errors 
being concentrated in the experiments in which Am=0 is rejected, and being 
greater than 100x per cent in these experiments. 

Scheffé (1953) suggested that any component of interest should be tested 
by inserting its L; in eqn. (9) with », = (a — 1), and rejecting A,=0 if the com- 
puted F> Fiv This method uses the experiment as the basis for its error-rate 
choice, rather than the multiple decisions which will be made. The experiment 
as the basis for error-rate choice is traditional but in the author’s opinion 
incorrect where multiple decisions are to be made. 

It is proposed here that the Scheffé procedure be modified simply by using 
tables of Faq, to test the F of eqn. (9) instead of the traditional tables of 
Bus Tables of F^ have been provided by Rodger (1965). " 

The justification for this new procedure 18 that (1-ау” is the minimum 
probability of correctly accepting all contrasts, or other linear components, cach 
set at (1— «). If two components are tested with күре 1 error-rate a, the 
probability that both will be correctly accepted is the probability of accepting 
one, (1 — a), multiplied by the conditional probability of accepting the other. 
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accepting the second correctly, given the Correct acceptance of the first, is 
(1—a) and the probability of accepting both correctly is (1— а) When the 
two components are identical, such as кү and ky=w,x,, the conditional proba- 
bility of correctly accepting the second is not (1-а) but 1, and the probability 
of correct acceptance of both is not (1— а)? but (1-а). Any component іп 
excess of a set of Ур With coefficient matrix rank x, is perfectly predictable from 


отек ка... +o, к 


құты” 

The conditional probability of accepting any such component, given the 
acceptance of the y, others, is therefore 1. The probability 
all components is the Product of a sequence of Probabilities, those beyond the 
vıth being 1. This Product has a minimum value (1-а) 
consists of mutually orthogonal components. The 

rejecting at least Опе component in error (Am=0 in t 
1 =(=. The Probability of rejecting Am=0 in error will be 1-(1-а)», 
if We insert Li=Ly in eqn. (9) and evaluate this against Fy) n. The sampling 
distribution of Ly is F with и — (4 — 1) and "o7 (1 — 1) degrees of freedom—as 
1s well known, With a Strategy of making unrestricted tests there is no need 
st, but testing it will generally 


to test Aj, — 0, a component of little empirical intere. 

save time, since if An=0 is accepted all other observed values of F from eqn. (9) 
will be less than Ра ун. This may be used to test components which are not 
contrasts by Setting y, =a. 

Scheffé defined his test Criterion as S-(uPF, y. 


We shall redefine this 
to be 


S= (FL cu? MSW [ny (18) 
: А E 
which is the size of the confidence interval for a ther linear expres- 
sion k;. This is a useful expression which may 


ay be calculated Опсе, then used 
to search among the observed values of ki which have a common У суз Тһе 
Усу. 


Contrast, or о 


new method introduced here can 
interval length, which is 


Ri- (Fe as Усу? MSW [ny., (19) 


2 
The decision rule can be reformulated in 


к : 
also be expressed in terms of а confidence 


terms of this quantity as, 
Reject к; if РУЛ (20) 
If we employ this rule the Probability that 
component) will be rejected in error cannot exceed 1~(1-a)", 
Duncan (1955) Suggested using Бау" but he probably intended that this 


be used to test all comparisons in 2 nested fashion in place of his DMR method, 
which will be discussed later. 


Tukey (1953) Suggested Using the Studentized range distribution as the 
criterion for unrestricted tests. This method inserts the observed value of any 


at least one contrast (or other linear 
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contrast k; in eqn. (10) and rejects x; if the resulting 4>41-, which is the 
studentized range value for a range of а. If we modify Scheffé’s notation once 
more, and define 

T i= Уеа. (МУ а), (21) 

i 
the decision rule proposed by ‘Tukey is 
Reject x; if |i|> Ti- (22) 

As was true of the S technique, the T method uses the experiment as the basis 
for error-rate choice. "Тһе method сап be modified to give the error-rate a 
decision basis in the case where unrestricted tests are to be made by using 


Di= VEleulo-a (MSW/n)! @) 
2 


and the decision rule 
Reject x; if [ki] > Di. (24) 

Duncan (1951) uses фу „ун іп a test. procedure (DMR) for unrestricted 
comparisons, but this is not identical to the D method proposed above. Exten- 
sive tables of q, , and qu, have been produced by Harter et al. (1959), ап 
abridged version of q,_, by Harter (1960 а) and of qq.» by Harter (1960 b). 
Abridged tables of both test values are given in Rodger (1965). 

D and T are simple range methods, but multiple-range methods have also 
been proposed to solve the unrestricted tests problem. Newman (1939) and 
Keuls (1952) have developed a procedure first mentioned by Student in a letter 
he wrote to E. S. Pearson in 1932 (see Pearson, 1938). This technique arranges 
the observed means in increasing order of size, say My, М... Ma, and tests 
comparisons between them from the outermost inwards, in a nested fashion. 
Тһе interval ; 

KMR; = 4, (MSW /n} (25) 
being the critical studentized range statistic for a range of 


is set up with фа 
a vo=a(n—1). The outermost 


h+1 means, with error degrees of freedom 
comparison (ua —4t = 0) is rejected if (Ma — My) reaches or exceeds КМКһ for 
h=a—1. If this comparison is retained, all are retained. If it is rejected, the 
two comparisons inside it (ма —plg=0 and pa. 3 = — 0) are tested against KMR; 
for h=a—2. If either or both of these are rcjected, the comparisons inside 
either or both are then tested against KMR; with h=a—3. The process 
when all comparisons for fixed / are retained. 


terminates 
KMR uses a variable basis for error-rate choice, the range а being first, 
Hien a= 1, 2,2. The first of these uses an experiment rather than a 
decision as a basis. Duncan (1951) modified this to a decision basis by using 
DMR, =q, (МУ) (26) 


This is still a multiple-range test and uses a variable 
oice-—a —1 decisions, a —2 decisions, а —3 decisions... 
ber of decisions made directly at each level is the reverse 


to reject comparisons. 
basis for error-rate ch 


1 decision, yet the num 
of this, 1.е., 1,2... a—3,a—2 and a—1. 
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Multiple-range methods are for comparisons and may not be used for other 
types of contrasts. They have some affinity with the protected ¢ test, not only 
in the modified basis for error-rate choice but also because of the rule that no 
comparison may be rejected, even though its А2 КМК, it is contained 
within another which is retained. 

The methods for making unrestricted numbers of tests recommended here 
are either R or D. These two methods will not always reject the same com- 
ponents and the investigator must choose which he will use independently of 
the data. 

The relation between these and other methods may be clucidated by 
comparing the lengths of the confidence intervals for к; which each uses. These 
lengths are shown for comparisons with a=5 and a=10 in Table 1. The 
multiple-range methods use varying interval lengths but, if we use their average, 
weighted lengths, the increasing order of length is z, DMR, D, KMR, R, T and S. 
When contrasts other than comparisons are used, the order of lengths changes. 
This can be seen in Table 2 which gives the lengths when а=5 and a —10 for 
the contrasts 

ку=щ tua — 2u =0 
and 

Kap + 3u — 24 — 2m4 — 2, — 0. 
Тһе order now is, roughly, 2, R, D, S, and T. depending to some extent on a 
and the contrast being tested. 

Тһе differences in confidence interval length between the techniques in 
Table 1 or Table 2 are due to the different sampling distributions they employ 
and how the decision error-rates are assigned. For S the probability is а = 0-05 
that the maximized contrast will be rejected in error and T assigns this error-rate 
to the maximum observed comparison. R and D assign 1-(1 —0-05)" as the 
maximum probability of false rejection for at least one contrast (R), or com- 
parison (D). РМК and КМК use a variable basis for error-rate assignment. 


6. Summary 


It has been proposed tliat a type I error-rate а be set for each statistical 
decision to be made. In the simple analysis of Variance any linear component 
among the а means (.A;=0) may be tested by F= Lij MSW. гаа, ig 
ponents are predicted, with coefficient шайга rank p, ше m 4. i 
9i — 1 and we reject the component if F> P, , for the pary at p ымы: 
va a(n — 1). This is the ¢ test procedure and x is the type I error-ra ae a ч 
of the p decisions. When a component with p degrees of freedom is tested 
(such as Am=0), and no resolution of this component into parts is intended, 
the test formula has 7 —p and we reject the component of F> F, , for degrees 
of freedom v, and və ‘This is a goodness-of-fit, single-decision procedure, the 
type I error-rate for this being a. 
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When more than а linear components (or a—1 contrasts) are to be tested, 
we set v, —a (or v, —a—1 for contrasts only) and reject each component for 
which F> Fa 4 with degrees of freedom vı and vy. When more than v, 
decisions are made, the set may contain intransitivities. Such intransitivities 
may be removed by reducing the set (post-hoc) to a set of v4, the coefficient 
matrix of which has rank ғ, In particular, a set of v, mutually orthogonal 
components will remove such intransitivities. 

Тһе studentized range distribution may be used for any of the above 
procedures. This distribution will yield identical decisions to F in the first 
strategy but not, generally, in the other two. The investigator must select, 
independently of his test data, which distribution he will use. 
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THE ASSUMPTION OF MULTIVARIATE NORMALITY 
Ву R. S. LOCKHART! 


University of Sydney 


Although the multivariate normal assumption is common to many of the 
multivariate statistical methods used to analyse psychological data, its tenability 
is rarely examined. It is possible to derive a goodness-of-fit test of this assump- 
tion, but in practice sample sizes will not be large enough to make complete 
testing feasible. In such a situation a partial testing procedure is advocated, 
together with studies of robustness. A numerical example is given illustrating how 
the partial testing procedure may detect instances of distributions which have 
normally distributed marginals but which are not multivariate normal. 


1. INTRODUCTION 


The assumption of multivariate normality is common to many of the 
multivariate statistical methods applied to psychological data. Despite its 
ubiquity there is no method in common use whereby such an assumption can 
be tested; the consideration of whether an observed set of data conforms to the 
multinormal model is usually limited to an examination of the normality of the 
marginal distributions. It is well known, however, that marginal normality is 
not a sufficient condition for joint normality, and it is an easy matter to construct 
distributions which have normally distributed marginals but which are not 
multivariate normal (Carroll, 1961, p. 360). More generally, while it is true 
that if an z-variate distribution is joint normal any subset of the z variates will 
have a multivariate normal distribution, the converse does not necessarily hold. 
In particular, the fact that the z(n-1)/2 bivariate distributions of an z-variate 
distribution are bivariate normal does not ensure that the distribution is 7-variate 
normal. An examination of bivariate scatter-plots is therefore of limited value. 

As Binder (1959) has pointed out in the context of the bivariate normal 
model for correlation, there is much to be gained by examining data to see if 
certain assumptions are tenable. ‘This is particularly true in the case of the 
multivariate normal model, since it constitutes the basis of most available 
multivariate statistical methods (see, e.g., Anderson, 1958). Two examples 
which are of particular importance to psychologists are multiple-correlation 
methods and the statistical procedures which have been developed in association 
with factor analysis. With respect to the former it is important to note that if 
the distribution is not multivariate normal, it follows that either the regressions 
are non-linear or the variances of the conditional distributions about the 
regression lines are not homogeneous.” As far as factor analysis is concerned, 


nnsylvania State University. 
orem see Kendall and Stuart (1961, Vol. II, pp. 352-354). Тһе 
bles are mutually completely independent or they are functionally 
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the whole range of maximum likelihood estimation and testing procedures now 
available (see Lawley and Maxwell, 1963) are based on the multivariate normal 
model. 

Given the fundamental role of the multinormal model in multivariate 
statistics it is important to enquire into the feasibility of devising some means 
of testing the tenability of this assumption for a given set of sample data. 


2. DERIVATION OF A POSSIBLE Goopness-or-Fir ‘l'Est 

The multivariate normal distribution possesses a number of distinctive 
properties which in principle might form the basis of a goodness-of-fit test for 
sample distributions. "The general difficulty is to fi 
will lead to a satisfactory test and which will re 
three variates. Examination of the higher order moments, for example, presents 
itself as one possibility. Under the null hypothesis of multivariate normality 
these moments are fixed functions of moments of the first two orders. Apart 
from the problem of the sampling distribution for these higher order moments, 
the number of such moments that would need to be considered in any 
test rapidly becomes prohibitive as the number of variates increases. 

There is, however, one well-known property of the normal distribution 
which leads to a possible test and which readily generalizes to any number of 
variates. 


Consider the set of p variates represented by the column vector x or the 
row vector 


nd such a property that 
adily generalize beyond two or 


reasonable 


x —[xxj...xpy... Xp] 
and let 
E(x)-w = [une - ш... pp] 
be the row vector of their respective means. Let V bethe b x p variance-covariance 
matrix for the p variates. If the joint distribution of x is multivariate normal 
its probability differential may be written as 
dF cexp{—3(x—p)'V-'(x—p) HI dx. (1) 


Then there exists an orthogonal matrix, H, such that the transformation 


x= Hy (2) 
yields 


H'V-'H - D, 
where D is a diagonal matrix? Since H is an orthogonal matrix we have from 
eqn. (2) that 


y-H'x. (3) 


? This is a well-known theorem іп matrix £ 
1 atrix algebra (see | 
p. 347). I am indebted to Prof. H. O. Lancaster of the Univer: 


relevance in the Present context. 'The theorem is in fact the ba 
component analysis. 


ndall апа Stuart, 1963, Vol. 1, 
of Sydney for pointing out its 
sis of the method of principal 
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The transformation matrix required to make D a diagonal matrix is the 
matrix of eigenvectors of V-! (which are also the eigenvectors of V). D is 
then the diagonal matrix whose elements are the eigenvalues of V. It is to 
be noted that D—L-! where L is the diagonal matrix of the eigenvalues of V 
with typical element /;, say. 

Substituting eqn. (2) in eqn. (1) it is easily shown that the latter becomes 

dF ccexp (— (у - H'i) D(y —H’y) I dy. 
Let р = Н'р be the column vector of the transformed means. Then replacing 
D by L-! we have 

dFocexp(— (у — u*)'L(y —и®)}П dy. 
Since L is diagonal this may be rewritten as 


dFoc Пехр{— n QE May. (4) 
(і-1,2...р) 


Thus under the hypothesis of multivariate normality the eigenvector 
transformation given by eqn. (2) yields a set of normally and independently 
distributed variates, yi ‘The distribution of the yi will have means pi* and 
variances /;. 

To test the hypothesis of multivariate normality one might first examine 
the original marginal distributions for normality, since if this condition is not 
met the distribution is clearly not joint normal. Supposing, however, that 
these marginals are found to be normal or are rescaled to make them normal, 
the hypothesis of multivariate normality could be tested by replacing V with 
the sample covariance matrix, applying the transformation of eqn. (3) and 
testing the resulting distribution for marginal normality and joint independence. 

This latter testing can be accomplished employing the usual multiple 
contingency chi-square procedures. Under the hypothesis of marginal 
normality (of the уг) and complete independence, the total chi-square con- 
stitutes a single test of the null hypothesis of multivariate normality. 

A difficulty which immediately arises is that unless the number of measures 
on each variate is very large, many of the expected frequencies for the calculation 
of the total chi-square will fall below unity as soon as the number of variates 
becomes at all large. Suppose, following Mann and Wald (1942) and Gumbel 
(1943), we select marginal class intervals so as to equalize expected frequencies. 
If there are N such intervals on each marginal, then in order that the expected 
frequencies in each cell be greater than or equal to a we would require 2 =aN? 
measures on each variate. Even with as few as four class intervals on each 
marginal and a minimum expected frequency of five, the required sample size 
rapidly becomes very large. For three variates it is 320, for four variates 1,280. 
This difficulty is only in part a result of the asymptotic nature of the chi-square 


distribution as applied to contingency analysis. It is largely a consequence of 


a difficulty that would confront any goodness-of-fit test; the number of degrees of 
S.P. 


Е 
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freedom on which an adequate test must be based rapidly exceeds the bounds 
of a reasonable sample size. Unless there are many observations constituting 
points in the multidimensional sample space only the grosser features of the 
underlying continuous population distribution function can be determined. 
Тһе problem becomes somewhat analogous to testing the normality of a uni- 
variate distribution on the basis of only a few observations. 


3. Discussion 


Since in psychological data the number of observations is typically small 

relative to the number of variates, it is clear that in most cases an adequate test 
of the multivariate normal assumption will be virtually impossible. One is 
then placed in the highly undesirable (although frequently practised) position 
of making statistical decisions, the validity of which rest on an untested 
assumption. In the face of this difficulty two policies are advocated. 
(i) A Partial Testing Procedure Ап incomplete test can be accomplished by 
applying the transformation (3) and then carrying out as much of the con- 
tingency analysis as the sample size will permit. In most cases it will be 
feasible to test only the normality of the transformed marginals, although by 
collapsing intervals, pair-wise, (and possibly higher order) independence might 
also be tested. It is suggested that this partial testing will detect most instances 
of multivariate distributions which have normal marginals but which depart 
seriously from joint normality. A numerical example of this partial testing 
procedure is given in the final section of the paper. 


(ii) Studies of Robustness ‘The second policy advocated is that investigations be 
conducted into the robustness of relevant statistical methods against violation 
of the multivariate normal assumption. In cases where an adequate test of a 
distribution assumption is not possible the issue of robustness becomes one of 
primary importance. An example of such a study is the recent work of Fuller 
and Hemmerle (1966) on the robustness of the maximum likelihood estimation 
procedures in factor analysis. 

One of the difficulties associated with studies of robustness is the large 
number of alternatives to be considered. Violation of the multivariate normal 
assumption could occur in an indefinite number of ways, and in the multi- 
variate situation it is difficult to specify the kinds of violations most likely to 
occur in practice. The principles from which the test described in this paper 
has been derived provide a possible basis for a systematic study of robustness. 
Violation of the multivariate normal assumption implies that the variates after 
being transformed according to eqn. (3) are either non-independent or have 
non-normal marginals. ‘The sensitivity of a given procedure to violation of 
each of these implications could be assessed. 

In particular, variations in the form of the transformed marginal distributions 
might be systematically studied. Furthermore, each of these distributions is 
associated with an eigenvalue (its variance) which represents a certain proportion 
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of the sum of the variances of the original p distributions. In most psychological 
data, especially ability measures, there will typically be a considerable discrepancy 
between the largest and the smallest eigenvalue. It seems reasonable to suppose 
that the non-normality of a transformed marginal distribution associated with 
an eigenvalue which accounts for a high proportion of the total variance repre- 
sents a more serious departure from assumption than does that of a distribution 
associated with a relatively small eigenvalue. 


4. NUMERICAL EXAMPLE 


The partial testing procedure based on the method outlined above will be 
illustrated with a simple bivariate example. "This will also serve to show how 
the procedure may detect a distribution which displays marginal normality, yet 
diverges quite markedly from joint normality. 

A set of 1,000 contrived Scores on each of two variates were standardized 
and grouped in intervals of 0-5 of a standard deviation. Тһе joint frequency 
distribution is given in Table 1. The marginals of this distribution conform 
exactly to the normal curve. Routine calculation gives the following results 
0-7071 0-7071 1, = 1:463 
0-7071 0-7071 Ly = 0:537 

The scores were then transformed according to eqn. (3) and for convenience 
of presentation were standardized and grouped in the same class intervals as 
the original distribution. The marginal frequencies of the transformed dis- 
tribution are given in parentheses in Table 1. Тһе application of a chi-square 
goodness-of-fit test to these transformed marginal distributions yields the 
results summarized in Table 2. The hypothesis that these distributions are 
normal is clearly untenable. Thus the hypothesis that the original distribution 
is bivariate normal must also be rejected. 


TABLE 2. CHI-SQUARE Goopness-or-Fir "TEST Fon THE MARGI 


NALS OF THE TRANSFORMED 
DISTRIBUTION 


Transformed Eigen- 
Marginal value x à f. P 
1 1:463 28-1 9 <0-001 
2 0-537 155-6 9 « 0:001 
Total 183-7 18 <0:001 
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CLUSTER ANALYSIS OF NON-VERBAL JUDGEMENTS OF FACIAL 
EXPRESSIONS 


Ву PETER STRINGER 


University College London 


ents of facial expressions have used either 
or non-verbal multidimensional techniques. 


Certain disadvantages in the methods so far used are described, and a simple 
alternative procedure is tested. Data from a ‘ free-grouping ' task are analysed 
by a clustering method which is congruent with the assumption that the primary 
response to facial expressions is in terms of contextual or functional similarity. 
The principal disjoint clusters obtained are described in detail. The meaning 
of the clusters and of their interrelationships is discussed. It is suggested that 
there are wider applications both of the * free-grouping ' procedure and of cluster 


analysis. 


Previous researches on judgem: 
verbal scales, pre-selected or elicited, 


1. INTRODUCTION 

In recent years the most noticeable body of research on the judgement of 
facial expressions has been devoted to elucidating the dimensional theory first 
proposed by Schlosberg (1952, 1954). It has been shown that subjects’ responses 
to emotional expressions can not only be meaningfully interpreted in terms of 
three dimensions (pleasant-unpleasant, attention-rejection, and sleep-tension), 
but that subjects can also reliably locate these expressions on scales corresponding 
to the dimensions (Engen and Levy 1956; Engen, Levy and Schlosberg, 1957, 


1958; Triandis and Lambert, 1958). 
The weakness of this approach is that by 
verbal scales to arrive at dimensions it tak 


tactics that may be importantly employed in extra-e i (pat 
the first place selected scales may include the trivial and irrelevant and exclude 


the essential, from the viewpoint of an individual's everyday behaviour. Nu 
experimenter rather strangely undertakes the onus of anticipating and was ing 
into his procedure much of the data which one might expect him to derive from 


the experiment. 

As a partial solution t 
by subjects in interpreting 
tion and selection by judges to arr 
Philipszoon, 1963). When these sca 
identified, one at least of which (natura 
condescension) did not correspond to those 
may be because they committed a ' category 
scales which probably would be taken to re 
than those relating to temporary emotional state. 


employing only certain pre-selected 
es no account of other cognitive 
xperimental situations. In 


о this problem a list of 300 terms, used spontaneously 
subjected to further categoriza- 


facial expressions, was su 2] e 
ive at a reduced list of 27 scales (Frijda and 
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Secondly, even if one were to elicit scales from individual subjects, in the 
manner of the repertory grid technique (Kelly, 1955), the possibility still 
remains that as verbal scales they may not adequately represent the responses 
available to a subject in extra-experimental situations; or, alternatively, that 
they invite the subject to respond in what may be a relatively unusual modality. 
It can also be objected that the verbal procedure leads to dimensions which 
explain not so much the underlying structure of facial expressions as of words 
which are or might be used to denote facial expressions. 

Multidimensional scaling has been used to avoid this difficulty (Abelson 
and Sermat, 1962), with the experimenters asking for similarity ratings on all 
78 possible pairs of a set of 13 photographs. Evidence emerged that two of 
Schlosberg’s three scales were sound; the attention-rejection scale proved 
redundant, and no additional meaningful dimensions were forthcoming. ‘The 
drawbacks in this investigation lie in the small, though in one sense representa- 
tive, sample of stimuli used (they were chosen as representative of Schlosberg’s 
original set of 48), and in the nature of the task involved. Even if a small set 
of stimuli are used, such multidimensional techniques require the subject to 
perform a task which at best tends to be artificial and tedious; with a large set, 
subjects need very high motivation indeed to perform. 

The present study was prompted by the wish to evolve a simple procedure 
for the judgement of facial expressions, which was both potentially non-verbal 
and also involved the subjects in a task which could be carried out quickly and 
spontaneously for a fairly large set of stimuli. The procedure should be as 
congruent as possible with everyday, extra-experimental situations; 
with the method of analysis used to derive results. 

On the assumption that a facial expression is generally perceived in the 
context of a series of preceding and succeeding expressions, it was hypothesized 

that the primary response to one or more expressions would be to construe 
their contextual (or functional) similarity to other examples (past or present). 
It would thus be a natural and simple task to require subjects to group a set of 
facial expressions according to observed similarity. And since cluster analysis 
offers a non-parametric, descriptive way of grouping functionally similar elements 
in a matrix of interrelationships, a congruent method of analysis is available for 
determining the structure of subjects’ judgements in this task. 


and also 


2. EXPERIMENT 


Subjects 30 female art-teacher students at B. 
second year of study. All the subjects w 
in the experiment was refused. 

Materials 
used, and a “ге 


ath Academy of Art, in their first or 
ere randomly selected; no invitation to participate 


The 30 photographs described by Frijda and Philipszoon (1963) were 
е grouping ° procedure was introduced. 


Procedure "Тһе complete set of photographs were shown to subjec 
random, open array, and Subjects were asked to group these 
facial expressions were included in the s 


ts individually, in a 
once in such a way that similar 
ame group. They were told that there was no 
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maximum or minimum limit either to the number or size of groups formed. It was 
emphasized that there was no right or wrong way of doing the grouping; the subject was 
simply told to group together those photographs which, in her own judgement, were most 
similar, in such a way that each facial expression was more like those others in its own group 
than it was like any other one in any other group. 

When a subject’s results had been tabulated, she was asked whether she was aware of 


having used verbal labels in the grouping activity; and was required to provide the most 
appropriate label for each group formed. 
The entire judgement session lasted approximately 20 minutes. 


Cluster Analysis of Data A variant (Cluster 3) of a general purpose clustering 
program described by Constantinescu (1966, 1967) was used. The method 
uses graphs and trees and their matrix representations, and describes the 
interrelations of a set of elements which may have a number of functionally 
common properties. 

In the present instance the inferred similarity frequencies for all possible 
pairs of the 30 stimuli were calculated. A matrix was derived by placing a 
count of 1 in the appropriate cell on each occasion that a subject grouped any 
one photograph with any other photograph. Cell frequencies could thus vary 
from 0 to 30, and were taken as the overall measure of judged similarity of all 


pairs of facial expressions. 

The distances between elements 
of elements were not associated together in the same group 
matrix M (‘Table 1). ‘Partially disjoint’ clusters of stimuli as grouped by 
subjects were derived from M by means of Cluster 3 (Table 2). The final 
pattern in Table 2 was produced by an Atlas computer in approximately five 


minutes. 


(that is, the frequencies with which pairs 
) were entered in the 


3. RESULTS 


Subjects produced variously from 4 to 13 groups (mean = 8-40), with from 
1 to 12 photographs in each group (mean = 3:57). ‘Almost all subjects reported 
that they found the task ‘easy’ and ‘natural’, although some felt that they 
could have elaborated and qualified their judgements given considerably more 


time. Just over half reported the conscious use of verbal labels while forming 


groups. 


The search for principal disjoint clusters (not only on a given level) revealed 


five: (A) Worry—(stimuli numbers) 6, 8, 9, 12 (level 18); (B) a pom 
7. 11, 17, 20, 28 (level 24); (C) Surprise—2, 3, 4, 6, 17, 18, 19, er р 
(D) Thoughtfulness—10, 14, 21, 23, 24, 26, 29; (E) Happiness—1, 5, n. 5 
16, 25, 27, 30 (level 28) The level number indicates the level at which all the 
detente sera absorbed in the cluster, and corresponds to the value of the largat 
nt distance in the cluster. It will be noted that clusters A and Сап 

y disjoint; also cluster D does not 1n fact occur at any 
t points will be amplified below. The verbal 
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B and C are not strictl 
level of the cluster solution. These 
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labels were arrived at by considering all the labels attached by subjects to any 
groups containing at least one photograph from a particular cluster. In other 
words, the 30 labels given to each photograph as a member of a group were 
“summed ’ to produce first an ‘ average ” label for that element (Table 3); then 
these labels were ‘ summed” for each cluster to produce a cluster-title. [here 
was considerable general agreement among subjects over the labels; most of those 


given to any single photograph fell into a small number (one, two or three) of 
clearly definable and usually related categories. 


TABLE 3. VERBAL LABELS GIVEN то Ғасілі. Expression STIMULI 


This table lists the main categories of verbal labels given to each stimulus, and is only a 
summary of actual labels used in many cases. Nuances referred to in the text are omitted 
for the sake of economy. 


STIMULUS VERBAL LABELS STIMULUS VERBAL LABELS 


1 happy, pleased; calm 16 happy, pleased, amused 
2 surprise 17 surprise; horror, pain; fear 
3 surprise 18 surprise 
4 surprise 19 surprise; anger, rebellion 
5 amused, happy 20 disgust; horror, pain; misery 
6 doubt, worry 21 pensive; stubborn, aloof; angry 
7 disgust; pain 22 surprise; horror; fear 
8 worry; sadness; questioning 23 thoughtful; disdainful 
9 worry; doubt 24 thoughtful, calm 
10 pensive; stubborn; questioning 25 happy; thoughtful 
11 disgust 26 thoughtful; stubborn, aloof 
12 worry; thoughtfulness; anger 27 happy, pleased 
13 happy, pleased 28 pain; sorrow; aversion 
14 pensive, dreamy 29 thoughtful 
15 sceptical; thoughtful; cheeky 30 happy, laughing. 


The nucleus of cluster A is made up of elements 6, 8, 9 which represent a 
puzzled, questioning, and sometimes surprised, worry. 12 seems to be a more 
concentrated expression of worry, verging towards anger. 

Cluster B has elements 7, 11, 20 in its main sub-cluster, which chiefly 
represents disgust (distaste, aversion, etc.) and pain. 17 and 28 have more of 
the ‘ pain’ component, * horror-pain ' and © sad-pain ’ respectively. Since 20 
has more equal parts of disgust and pain with some misery it serves to link 7 and 
11 with 17 and 28—the axis of an ‘ aversion ’ cluster running from distaste to 


6). It may be preferable to discou 
disjoint, Elements 2, 3, 4 are the 
surprise (astonishment, shock). 


nt it from this cluster, and treat A and C as 
° pure ' component of the cluster, representing 
17 and 19 Tepresent respectively horror and 
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fear, and anger, in addition to surprise. 22 is closely related to both 17 and 19 
(despite the lack of an ‘ anger ’ component among its labels), as well as to 2, 3, 4, 
and this acts as the axis of this cluster. Element 18 is more particularly akin 
to 2, 3, 4 than to the others. A cluster of 2, 3, 4, 18, 22 (level 19) is an inter- 
mediary cluster between the nuclear 2, 3, 4, (level 10) and the whole cluster. 
Cluster C runs from ‘ pure’ surprise to surprise tinged with horror and fear, or 
anger. 
Cluster D is slightly more complex. The nuclear cluster comprises 
elements 21, 23, 24, 26, 29 (occurring in level 26), which resolves into two higher- 
level clusters—21, 23, 26 (level 18) and 24, 26, 29 (with 14, level 20). The 
former seems to represent a positive thoughtfulness (overtones of disdain, 
defiance, anger) mixed with determination, while the latter is described as a 
calmer, more contemplative thoughtfulness, with a suggestion of worry and 
sorrow (particularly 29). Element 26 sufficiently combines both aspects. 14 
occurs in a cluster with 24, 26, 29, but is not linked with 21 and 23. It repre- 
sents a particular example of peaceful, dreaming thoughtfulness. 10 combines 
determination, stubbornness, and questioning, with thoughtfulness; and is 
accordingly most similar to element 21. Although in the present solution it is 
not absorbed into any of the lower-level clusters, as 14 is, examination of the 
similarity matrix will show that it fairly belongs to cluster D. While 10 and 
21 represent one side of this cluster, 14 and 24 represent the other. x 
Happiness, ranging from calm pleasure to hilarious amusement, is indicated 
by the labels given to elements in cluster E. The former is represented par- 
ticularly by 1 and 25, the latter by 5 and 30. The higher-level cluster 13, 16, 
27 (level 11) combines both components to form a link between these two pairs; 
it verges rather in the direction of the former, particularly in respect of the labels 
attributed. Element 15 (scepticism, thoughtfulness, cheekiness) is шы 
integrated with this group, although it seems to combine (оны ап 
joking; in this context it clearly is not of the same order as other elements in 


clusters D and E. 5 
Where there was a similarity frequency of 7 OF men k indication 
pair, that is between elements of different clusters, this was taken as an п 


i "he f ving w 2 9 with 28 
f inter- sr relationships. ‘The following were found: 8 and 
и жет - A and C); 8 with 29, 12 with 10 and 21 (A and D); 


(clusters A and B); 12 with 19( 
ith 22 | Of with 25, 23 with 15 (D and E). (The 
7, 11, 20 with 22 (B and C); 14, 24, 26 wi A eu merid ier us 


1171 sters А an 
presence of elements 6 and 17 in clusters A ar ва тере : 
been mentioned above.) Again from the evidence of the labels, it is possible 


i i * Disgust ' “surprise? are 
to infer a description of these relationships. re m is зона 7% 
linked by a component of ‘ horror’, and are respectively related to Бы Y 4 

д 4 š orry 
i d n ‘angry questioning 

way of ‘pain’ and ‘sadness’, and by a y tion t 
joined v thoughtfulness ' in respect of both a ‘ questioning worty and an 
“angry с t san or ‹ stubbornness ' (it should be remembered that there 
ы * Thoughtfulness ' and ‘ happiness 


are ‘anger? components in cluster. À). 
are жіне ШШЕ by shared elements of thoughtfulness and calmness. 


е for any extra-cluster 
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4. Discussion 


These results cannot strictly confirm or weaken those of other studies, 
which have generally used factor methods ofanalysis. Cluster analysis identifies 
clustered patterns of functionally similar variables, rather than measuring 
component influences. А single cluster may contain several components, and 
a single component may be represented in several clusters. Furthermore, the 
description of a cluster’s meaning in the present experiment must differ in 
kind from the interpretations given to factors; the former is educed from the way 
in which subjects labelled the groups they formed, while the latter is a logical 
consequence of the subject's use of pre-determined verbal scales. 

However, in view of the remarks already made on the structure of elements 

in the present experiment, it is possible to describe an approximate line running 
from hilarity and happiness through thoughtfulness, worry and anger, to 
surprise and fear, or pain and disgust, and ultimately perhaps to horror. This 
line clearly reflects the * pleasant-unpleasant ' dimension of previous experiments, 
as well as the opposition of control to intensity of expression. Controlled 
expressions (thoughtfulness) seem to occupy a rather more axial than polar 
position in the overall pattern. ‘ Sleep-tension ’ and ' attention-rejection ' (or 
disinterest) are not exactly represented. Instead there is an opposition between 
active and passive expressions, between the axial thoughtfulness, worry and anger, 
and other clusters. (Active expressions reflect thoughts or emotions directed 
at, rather than produced by, a feature of the inner 9r outer environment.) 
Frijda and Philipszoon’s (1963) unique factor of ‘ naturalness апа submission 
versus artificiality and condescension ' is perhaps present here as an elaboration 
of ‘passive versus active’; for ‘active’ elements labelled “thoughtfulness ’, 
‘worry’ and ‘anger’, and only these elements, were also labelled ‘aloof’, 
' disdainful', and sometimes were considered to be ‘ posed’, ‘social’, or 
' phoney ’. 


5. CONCLUSION 
It is possible, by the experimental procedure tested here, to obtain a detailed 
and intelligible description of the way subjects structure a set of facial expres- 
sions. Five main clusters with their internal and external linkages have been 
described. How stable these clusters are, and how many additional ones might 
be found ina larger set of stimuli, can only be determined by further experiments. 
Studies are in Progress to test the assumption that the primary response to 
facial expressions is in terms of their contextual similarity; and to compare the 


med cluster analysis with solutions derived by other methods from the same 
ata. 


Since one of the 
30 stimuli was 
that their criter} 
and in the trj 


criteria of Frijda and Philipszoon (1963) in choosing their 
Iepresentativeness, we may ask how far present results confirm 
100 Was met. [n so far as there is imbalance in the size of clusters 
angular structure of clusters A, B and C, it was not met. 
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i шы в within сима may also rule against the claim. The 
i ed here might be extended to the specific study of stimulus 
equivalence and representativeness. 
м d e be possible to study subject typology in respect of responses 
! тее grouping task, by determining the frequencies with which any 
pair of subjects produced the same inferred judgement of similarity. А matrix 
could be formed and cluster analysed. The number and size of groups they 
produced, as well as their differing ease in arriving at verbal labels, might be 
The labels themselves could be used in a semantic 


interesting variables. 
omparison. 


differential situation, judging the same stimuli, for methodological c 

For the future this study suggests the investigation of facial expression 
articularly simple, non-verbal tech- 
method to different material 
tion of cluster analysis as a 


judgements by more varied techniques, p 
niques; the extension of the ‘inferred similarity’ 
and experimental situations; and a wider applica 
descriptive instrument. 
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A METHOD FOR INVESTIGATING CHILDREN'S UNDERSTANDING 
OF CERTAIN LOGICAL CONNECTIVES USED IN 
BINARY PROPOSITIONAL THINKING 


By E. A. PEEL 


University of Birmingham 


eference and an experimental method are described 
for investigating children's understanding of such logical connectives as If... then 
in certain binary propositions. A game is played with coloured beads and 
counters between experimenter and child which both obviates the need to use 
© naturally ' connected binary propositions and at the same time prevents the 
rejection of the propositions on account of their arbitrariness. ‘The games 
consist essentially of a bead draw by the experimenter followed by a responding 
draw of a counter by the child, according to the instructions for each game. _ 
_ Тһе frequencies of different conjunctions of bead and counter draws (pq, РФ, 
Ра, Pq, where p and q аге different kinds of beads and counters respectively) are 


obtained and converted to proportions. ed with 


These proportions are compar 
those expected theoretically according e definitions of the main binary 
propositions. 


A theoretical frame of r 


to th 


1. INTRODUCTION 

There have been few previous studies of what young children understand 

by: If... then, If... then not, and Either... 07 (Matalon, 1962; Naess, 1962; 
Hill, 1961; and Suppes, 1965), although there have been more extensive studies 
of the syllogistic reasoning of children and adults (Mays, 1963; Hill, 1961; 
Suppes, 1965; and Wilkins, 1929). 


When investigating what you en understand by th deris 
nectives, a real difficulty is found in having to model the propositions 1 


Way as to minimize the dependence of the method on language and meaningful 
associations. Also, in the case of the first set of connectives, If. - - then, t Р | 
15 the well known tendency, revealed by most people, to require à қараға rl 
relation between the two propositions connected by If - - - Be. іа a 
accept such binary propositions as If Kate switches on the " ee rene 
brother goes out, but find it more difficult to accept Jf cats have ¢ д ane 
Universe is expanding. It seems that the logician's purely formal na ie iine 
usually to be encased in a natural implication in which the secon prop 

Сап be seen to depend naturally or C? 


| usally upon the first. This has led both 
9 ici B B x 
&icians and psychologists to distingu! 


sh between these two ideas of ia en 
с f i itrary situatio: 
iiim (1962) investigates them separately by first using an arbitrary Я 
f two different co 


f [һа Ils 
i i i ts for what he ca 
^ à the switching on O 


i various con- 
ng childr e various 


loured ligh ies 
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° material implication '—that is purely formal implication —and secondly by 
using the natural setting of a village, in which the post office can only be reached 
by passing a certain house, for investigating what he calls * natural implication ’. 
However, it is possible to bridge the gap and to produce arbitrary propositions 
which do not outrage the sense of reality of the thinker, but which are none the 
less not dependent for their acceptance upon meaningful associations between 
the constituent propositions. This bridge can be effected by a games technique. 
The essence of an organized game is that arbitrary rules and relations are 
accepted, as in the rule that the ace takes the queen, or that if the ball is knocked 
forward in a game of rugby football a scrum must be had. Such a technique 
has quite definite advantages when investigating young children, as they are 
quite prepared to accept arbitrary rules in a situation which appears to have 
the qualities of a game. 
There is also a further difficulty when such binary relations are investigated 

by ordinary propositions, as in the case of the first one mentioned above: 

If Kate switches on the television, then her brother goes out. 

Kate switches on the television. 

Does her brother go out? 


and that is that one has to take great care that a conclusion is not reached merely 
because it looks plausible. ‘Thus in her study of the syllogistic reasoning of 
students, Wilkins (1929) showed that if a conclusion is naturally plausible, it is 
often accepted even though it does not follow from the premisses given. 
Similarly, Suppes (1965) draws attention to the same difficulty in the work 
conducted by Hill (1961) where a badly constructed reasoning problem is one 
in which the conclusion appears naturally plausible, such as Boys run faster than 
girls, 

Formal implication, Jf . . . then, сап be dressed out as follows: (a) by two 
naturally and causally related propositions; (b) by two unconnected propositions, 
so put together as to bring about a rejection of the implication, such as If cats 
have wings, then 24+2=4; and (с) by two arbitrarily connected propositions, 1/ 
you pass the ball forward, then your side will be penalized, which are accepted by 
the thinker as valid rules for a game. 

This paper is concerned with a method which utilizes the third type, and 
so avoids the rejections aroused in (0) and is free of the assumptions of natural 
causes implied in (а). Furthermore, by not calling for judgements of written 
or spoken propositions, it avoids a situation where a conclusion may be affirmed 
or denied because it is plausible or otherwise. By the game technique, it is 
hoped to make the binary relations acceptable to the child, and at the same time 
to enable us to concentrate on a binary relation independently of any natural 
relation between the two propositions involved. This approach, of course, 4068 
not deny that a child's understanding of If... then has been acquired as a result 
of natural usage within the family and with other children. Children are often 
subjected to popular meanings of implication, such as If you eat your pudding, 


^k 
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ГІ give y : 

и. ра а кан which do not at all carry the meaning assumed by the 
isi nil м t » o purpose of this method to investigate what the child 
ктүн ы -- then, If... : then not, and Either... or, but it is better 
umi wim унн чени not tied by previously learned, acceptable relations, 
Wilkins (95) жаз as are the methods of Hill (1961), Suppes (1965) and 
oon English РОСЕН А of complex propositions in the form of 


" 2. Tue GAME PROCEDURE 

Р... Apr Е ore? d ei Lu ает implication, incompatibility and 
individual ks pn voles ome р ne | canem the experimenter and each 
in a tin, there being at least 10 each T four acm D Снр хук 
green. In another shallow tin is ¢ imil veg E pom a 
<> ele жы is a similar collection of counters, also made up 
ни: І сас of the same colours. The experimenter and the subject face 
b h other with their beads and counters before each respectively, and an empty 

ox is placed on the table between them into which they make their play. The 
Se e Consists essentially of the experimenter drawing a certain bead, and then 
TE Sabjeet drawing a counter depending upon the rule of the particular game. 

he rules for each of the three games are briefly as follows: 
4 ©“ We may each pick апу colour we like, but in this game, 
if and whenever I draw a red bead, you also draw a red counter.” 


* We may each draw any colour we like, but in this 
a red bead, you are not to draw a red 


1. Implication: 


2. Incompatibility: 
game, if and whenever I draw 


counter.” 
3. Disjunction: “ We both draw so t 
between us. I shall draw first, an. 
a counter and put that in." 
b ed after each draw 
eads put back in their respective boxes. 
each child in a set random order consisting of six red 


10-bead draws are initial draws made by the experim' 
has to make the response by drawing an appropriate counter. The child's 


draws are recorded in relation to the initial draws made by the experimenter and 
so a complete record of all possible conjunctions of bead and counter draws is 


kept. 
Тһе game is also played in the 7e 
"vain an initial bead draw and the experi 
raw. The child is then asked to say whether the expe 
Ог wrong. Ву this technique the investigator is able to concen 


о 4 . * . . x 
Ccurring conjunctions made by the child in the binary games 0 


ast one red in the box 


hat there is at le 
n, and then you draw 


d put my bead i 


in each game and the counters and 

Теп pairs of draws are made with 
s and four non-reds. The 
enter, to which the child 


The box is empti 


hich the child is asked to 
kes a consequent counter 
rimenter's draw is right 
trate on the rarer 
f implication, 


verse form іп W 
menter then ma 


in УЙ; = : 
. incompatibility and disjunction. 
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3. PSYCHOLOGICAL REQUIREMENTS RELATING TO MOTIVATION 
AND REINFORCEMENT 


First, as to motivation, the testing was conducted in an informal 
atmosphere. The games aspect was heavily stressed in order to keep up 
the subject’s interest and enthusiasm. Children of the ages of 5 to 10 took a 
great interest in the competitive aspects of the game, that is, in the competition 
between each child and the experimenter. It was found that very little needed 
to be said to maintain this interest for such a short run as 10 draws. 

Such reinforcement as was given was not to obtain certain responses, but 
merely to ensure an act of responding (up to 10 draws). It was therefore impor- 
tant that the reinforcement should be general, pleasant and neutrally uninforma- 
tive regarding the draw made by the child. It was found quite easy to do this 
by a pleasant nod of approval, or an utterance of “ Yes" or “ Fine” and so on. 
Such a reinforcing situation would not have endured over a long run of draws, 
but it was found that 10 draws by no means strained the child when no informa- 
tion was given about the correctness or incorrectness of his response. "Гһе 
purpose of the 10 trials was to obtain a more complete picture of the child's 
responses in order to obtain as many as possible of the binary conjunctions 
involved, so that an adequate analysis of the group of children as a whole could 
be undertaken. 

Some 150 infant and primary school children played the games and they 
ranged in age class intervals from 5+ to 11+ with frequencies in each year 
class ranging from 13 to 30. Since each child responded by making 10 draws, 
there were, in all, some 1,500 conjunctions to analyse. 


4. THEORETICAL Basis OF THE Games 

The operative colour used in the draws was red. Let us write R for a red 
bead draw and R for a non-red bead draw, made by the experimenter. Let 
us also write ғ for a red counter draw made in response by the child and F for a 
non-red counter draw. Non-red means blue, yellow or green. "lhere are, 
then, four possible conjunctions of bead and counter draws, Rr, R?, Rr and ЖР, 
each representing a game between experimenter and child. Clearly, the total 
frequency of occurrence of such conjunctions for any group of children can be 
counted. Thus for the group of fifteen 5 to 6 year-olds, each of whom played 
10 games, the total frequency would Бе / +/+, ./р-150. Furthermore, 
we may divide these frequencies Чр according to whether they are based on 
initial R and R draws as follows. There would be 90 initial R draws and 60 Ë 
draws by the experimenter. Hence f, "E ‘4 f_ = 60. e ma 
reduce these frequencies to Meist, ain. ex m ee Mes 
and so on. uiis i 

Тһе three binary propositions of implication, incompatibility and disjunction 
will be considered in terms of these Proportions. The three propositions will 
be defined in the manner of the logician, namely: 


vj 
T 
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Implication Кэт 
by RrvRrvRi, — Ri being false; 

Incompatibility RDF 
y by Riv Rr v Rr, Rr being false; 

Disjunction Rvr 


by RrvRiv Rr, Rr being false; 

and use these as hypotheses to compare with the incidence of binary conjunctions, 
Rr, Rr, Rr, Rr, actually arising in the course of the games. 

"'aking first implication, If... then, as logically defined, it would be expected 
that all the children’s counter draws to R bead draws would be r, with no 7 
draws (RF being false) and that there would be an equal distribution of r and 7 
counter draws to initial R draws. 
_ The above binary conjunctions, 
in Table 1: 


in terms of the forward games, are shown 


TABLE 1. IMPLICATION: PROPORTIONS OF ConJUNCTIONS 


Initial bead draw by - 
experimenter R R 
Responding counter draw 
by child r y r ú 
Binary conjunction Rr Rr Rr Rr 
Truth value according to the 
true false true true 


logical meaning of implication 


Theoretically expected 
proportion of each 
conjunction according to the 
8 1 0:5 0-5 


above truth values zero 


d proportions for the remaining games of 
2 


btained and are shown in Table 2. 
rtions expected for the 


since these two 


Similarly, the theoretically expecte 


incompatibility and disjunction may be o 
It is worth-while also to set out the theoretical proport? 
cal equicalence and mutual exclusion, 


the results obtained (Table 3... 
he form of obtaining the observed 


p Rr, Rr and RF and 


Proportion of occurrence of the four 


Comparing these wi soretical propor 
se with the above theoretical p p Be . . 
The theoretical proportions of the Rr, Ri, Rr and RF conjunctions may also 


2 ^ 5 фа an initia w 
p computed for the reverse game. In this game there is an initial Ln pon 
y the child, a responding counter draw by the experimenter and à 


1 r aj sarrect 5 таба ҰР З Іп 
acceptance by the child of this counter draw as being correct or incorrect Ж. 
ber of acceptances and non-acceptances by the cnt 


the 
reverse game the num 
ns are based on these data. 


are i 
counted and the proportio 
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TABLE 2. INCOMPATIBILITY AND DISJUNCTION: PROPORTIONS OF CONJUNCTIONS 


Initial bead draw by 
experimenter R R 


Experimenter [child binary 


conjunction Rr Rř Rr Rř 
INCOMPATIBILITY false true true true 
Theoretically expected 

proportions zero 1 0-5 0-5 
DISJUNCTION true true truc false 
Theoretically expected 

Proportions 0-5 0-5 1 zero 

TABLE 3. 


EQUIVALENCE AND Метил. ExcLustoN: PROPORTIONS OF CONJUNCTIONS 


Initial bead draw by 


experimenter R R 
Experimenter/child binary = 2% 

conjunction Rr Rr Rr Rr 
EQUIVALENCE (Rr v Rr) true false false true 
Theoretically expected 

proportions 1 zero zero 1 
MUTUAt exciusion (RF у Rr) false true true false 
Theoretically expected 


proportions — 


f acceptances and non- 
rding to the five binary propositions required 
агі game. In this case the proportions аге 
of instances of each b 


acceptances can be tabulated ассо 
to analyse the results of the forw 
based in each case on the number 
separately (Table 4). 

Lastly, the matching of поп-г 
the tendency, when the experi 


inary conjunction taken 


Proportions expected by chance. A count may be 
made of the total frequencies of matched Rr draws (bluc-blue, yellow-yellow, 


green-green) fm and of non-matched RF draws (blue-green, yellow-green, blue 


yellow, etc.) f= for each of the three logical games, These frequencics can be 
converted to the proportions: 


fm f- 
Тір Pm and ja 
Int x ?» faf P. 


m 
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It is evi i non. 

evident that by chance, with three non-red colours it i p 
т-0:33 and bz 0-67 f 1 е 5, It 15 ех ected that 
bm = = 0-07 tor all three logical games. These proportions are 


used a i i 
s a base line to examine the tendency for children to match non-reds 


TABLE 4 
. "THEORETICAL PROPORTIONS OF THE ACCEPTABILITY AND NON-ACCEPTABILITY 


OF THE CONJUNCTIONS 


Initial bead draw by child R R 
Conjunction of the above 
bead draw with the Rr RF Rr Rr 
responding counter draw 
made by the experimenter 
Decision of the child 
about the experimenter's acc. non- acc. non. acc. non- acc. поп- 
draw of a counter acc. acc. acc. С асс. 
Theoretically expected 
proportions 
Implication 1 тего zero 1 1 zero 17 WE 
fni сот, patibility zero 1 1 zero 1 zero zero 
Disjunction 1 zero 1 zero 1 zero zr 1 
Equivalence 1 zero zero 1 zero 1 1. = 
Mutual exclusion zero 1 1 299 i pes š т 


Метнор ON SOME RESULTS 


5, ILLUSTRATION OF THE 
method, some results 


ate the working of the 
ps are now presented. 
һе bead-counter C 


for children 


Ia order to illustr: 
of different chronological age grou 

Table 5 shows the frequencies oft 
forward game of implication. 


onjunctions in the 


TABLE 5. IMPLICATION GAME: FREQUENCIES OF Brap-COUNTER ConJUNCTIONS 


Initial bead draw R R 

Bead-counter conjunctions Rr Rr Rr Rr 

Sample 
Size 
ЭЗЕ 15 83 6 3 57 
Age group 8+ 13 77 1 3 R 
11+ 30 179 1 4 11 

These frequencies give rise to the proportions shown in Table б, А 
the children reacted to the game instruction 
e in this 


When they Wer 


From Table 6 it seems that 
the antecedent 


1 

4: d then as if it meant If and 

Als they were called upon to produce thi 
oice made by the experimenter (see Discussion). 


only wher. - - then. 
he consequence to 
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TABLE б. IMPLICATION GAME: PROPORTIONS OF BEAD-COUNTER CONJUNCTIONS 
Bead-counter conjunctions Rr Rr Rr Rr 


Theoretically expected proportions 


based on 
(a) Implication 1 zero 0:5 0:5 
(b) Equivalence 1 zero zero 1 
( Se 0:92 0:07 0:05 0:95 
Age group 8+ 0:99 0:01 0:02 0:98 
1+ 0:99 0:01 0:03 0:97 


For the reverse game of implication, played with the same children, the 


proportions of accepted and non-accepted bead-counter conjunctions shown in 
Table 7 were obtained. 


Таві 7. PROPORTIONS оғ ACCEPTED AND NON-ACCEPTED BEAD- 


COUNTER CONJUNCTIONS 
IN THE REVERSE GAME OF IMPLICATION 


Initial bead draw R R 
Bead-counter conjunctions Rr Rř Rr RF 
Judgement of child acc. поп- асс. поп- асс. non- асс. поп- 
асс. асс. асс. асс 
Theoretically expected 
proportions 
Implication 1 Zero zero 1 1 zero 1 zero 
Equivalence 1 Zero zero 1 Zero 1 1 zero 
5+ 1:00 zero 0-03 0:97 0-24 0-76 0-93 0:07 
Age group 8+ 1:00 zero 0:01 0-99 0:58 042 1.00 zero 
11+ 100 zero zero 1-00 0:82 048 0-98 0-02 


Apparently, here we see an effect, increasing by agc, of the 'repetition' by 
the experimenter of r counter draws to the child's initial R bead draws. T'he 
conjunction A has a key role in transforming an Jf and only when... then binary 
proposition (equivalence) to one of If... then (implication). 

In order to illustrate the game of incompatibility, the results of the reverse 
game are shown in Table 8. 


Here it is seen that even the young child accepts incompatibility as defined, 
and this finding is also clearly shown in the forward game. Apparently, although 
he has difficulty in accepting Rr in the If.. then situation, he has no difficulty 
at all in accepting the analogous RF in the If... then not fiib. 


Lastly, the results of the method in the forward disjunction game are 
shown in Table 9, 


Here it can be seen how difficult it w 


: à as for the child to cope with the «f 
least one red instruction, At 54 the best 


logical approximation is that of the 
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TaBLE 8. REVERSE GAME OF INCOMPATIBILITY 


zm 


Initial bead draw R 


Bead-counter conjunctions 


non- acc. non- 


Judgement of child acc. поп- асс. non- acc. 
acc. acc. acc. i 
Theoretically expected = Е 
proportions for logical zero 1 1 zero 1 zero 1 zero 
incompatibility 
5+ 0234 0:66 0:84 016 0:76 024 087 0-13 
“Аде group 8+ 007 093 1:00 zero 0-86 014 096 004 
11+ zero 1:00 1:00 zero 0-96 004 096 0:04 
'lAgLE 9. FORWARD Game оғ DISJUNCTION 
Initial bead draw R R 
Bead-counter draw conjunctions Rr Rr Rr Rr 
Theoretically expected proportions 2 
according to logical disjunction 0:5 05 1 zero 
Theoretically expected proportions 
according to logical tautology = lack " 
of any system of counter responding 0:5 05 05 05 
Theoretically expected proportions 
according to logical mutual exclusion zero 1 1 zero 
— ш ош 00% 
Age gruop 8+ 0:54 0-46 = " 
11+ 0:08 0-92 0:89 041 


disjunction, not of and r, but of R and F! At 8+ the results sp rcs d 
to the tautology which in this situation surely reflects a complete lack of a system 


of counter drawing by the children. At 11+ disjunction 15 soe nee 
inchisive domn (Білер. 6 +" or both) but only as mutua ex 


(Either 

алық); ; T 
. method in show! 

Lastly, we will demonstrat ue of the me anger children but 

an instruction concerning reds is not taken реш h y əз of comparing the 

is м ee š i the results І 

8 applied also to the other colours. 4 counters is 


Matching and non-matching of blue, yellow а А pe 
given in Table 10 for the games of implication and incompatibility. 


Here is exhibited the matching spread of the implication rule in Фа Барлы 
younger children, from the reds to other colours, and to a slighter oe 
the spread of the non-matching of incompatibility. By the age 0 


e the val ng how far 


nd green beads an 
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TABLE 10. PROPORTIONS OF MATCHING AND NON-MATCHING OF NON-RED CONJUNCTIONS 
Rr Bv CHRONOLOGICAL AGES 


Implication Incompatibility 
m m m m 
Expected proportions by chance 0:33 0-67 0:33 0:67 
54 0-91 0-09 0-21 0-79 
Age group 6+ 0:59 0-41 0:24 0:76 
8+ 0:38 0:62 0:18 0:82 
11+ 0-38 0-62 0-34 0-66 


spread of the rule seems to h 
expected by chance. 


The above tables of results аге given merely to illustrate the wi 
theoretical analysis and the sensitivity and 
experimentation. The full disc 


logical and mental 


ауе disappeared, for the proportions are those 


orking of the 


effectiveness of the method of 
ussion of all the 


p a method and theoretical 
framework for Investigating children’s understanding of logical connectives used 
in binary propositions. We are not here concerned Primarily with the results 


comment on the metho 


theory, some observations will be 

made on aspects of the results which have some methodologi 
Taking first the mplication game link 

nd that implication is 
between the children's 
compatibility (which is read by them as the 
new evidence, Disjunction caused most 
sed in the form: ‘There must be at least 
а complete loss апа the oldest children took 


ust be only one req? They read disjunction 
lusion, 


-cor game technique enable all the logical aspects 
of binary propositional thinking to be probed? 


Both Matalon (1962) and Hill 
(1961) were able to test the modus tollendo tollens logical situation, The form of 


ama —————— аа 
E . 
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volved the following arguments, Matalon using 


their binary proposition tests in 
ly, the modus ponendo ponens and tollendo tollens: 


all four and Hill the extremes on 


рэ q $24 $24 9:4 
E лап a. ИР ios mo 
.. What? (g) 22 What? (q v 9) .— What? (p v P) =. What? (р) 
indeterminate indeterminate modus tollendo 


modus ponendo 
ponens 


tollens 

e to test the modus tollendo tollens inference in the bead- 
counter game since it would have involved a non-red counter being drawn and 
a subsequent question being put to the child as to what kind of bead draw would 
have had to precede the counter draw. This would be calling for a certain 
abstracted quality of inference which it was not desired to invoke. 

However, it would be quite possible to test the modus tollens inference of 
implication by having the child observe a bead-counter game between two other 
people and allowing him to see only what coloured counter the second player 
drew, and then finally asking him to say what the first player had drawn. 

The disjunction inference, modus tollendo ponens, schematized as follows: 


рма 


It was not possibl 


Е: БЕБЕ 
2 What? (а) 


used by Hill (1961) was tested and found, as by her, to give no difficulty. A 
inate disjunctive argument: 


much more crucial test is the indeterm 


b vq 


? 

What? (4 or Ф 
this showed an almost complete lac 
e is taken to mean 077€ onlv. 
game, as used to 
If it were desired t 
tollendo tollens, 
and then be as 


As already mentioned, к of 4 responses, 
indicating that at least on 
In general, the bead-counter 
inferences from the antecedent. 
from the consequence, as in the modus 
observe a game between two other people 
concerning the initial bead draw. 
We now turn to à fiducial question. The results of the games are compared 
with certain theoretical models based on propositional logic. Essentially 
d with those expected 


obtained frequencies or proportions are being compare’ . : 
according to the logical definitions of the binary propositions. Thus in Table 5 
for the 5+ group of children are being compared: 


date, enables one to test 


o investigate inferences 
then the child could 
ked for his inference 


the following frequencies 
Conjunction Rr Rr 
Theoretically expected 
89 nil 


frequencies 


Obtained frequencies 
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What confidence, statistically, have we that 83 can be regarded as an approxi- 
mation to 89? It might appear to be a straight goodness of fit problem, com- 
paring 83 and 6 against 89 and nil, but the nil entry rules out this approach. 
There is a further complication that each child makes 6 responses to R draws 
(and 4 to R draws). Should these 6 conjunctions be regarded as correlated? 
(An alternative method might be to allow each child only two responses, one toa 
first R draw and another to a first R draw. But even if this is done, the first 
difficulty of making a statistical comparison appears to remain.) 

In this paper we have relied simply on the examination of Proportions, as 
in Table 9 on the disjunction results, to arrive at a comparison and at some 
identification of the children’s responses. 

To sum up, the method enables one to 
conjunction (Rr, Вғ, Rr or Rr) w 
when a child is asked to int 


pin-point precisely the particular 
hich are producing the general effects observed 
erpret certain binary connectives. By the use of 


я eal advantages as а Psychological technique in 
holding the interest of the children and in rendering binary propositions 
acceptable to them, 


Lastly, the method 
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А METHOD OF CLUSTER ANALYSIS 


By PauL CONSTANTINESCU 


University of Bucharest 


BL jas is proposed for clustering a set of elements with 
koi isi 2 cute Input information 18 described in terms of a graph 
ан med ires function between the elements. Definitions аге given 
a Serius E Не y iind a tree-representation ^ [tisshown that clusters 
uos : sin s o subsets (level-subsets) ina tree-representation ; and that 

al pattern contains at least one non-trivial tree, under specified conditions. 
An example of clustering is set out, and the computation of level-subsets is 


described. ‘The selection of clusters from à final pattern is briefly discussed. 


1. INTRODUCTION 


ssociated with the class 
penes: | е һесп considered for a long time, i 
ben era of high-speed computers (Ball, 1965) that practica 
: e available. The increasing quantities of data that have 
E „к fields of science demand methods for clustering the elements involved 

er to simplify the analysis to a point where only the * representatives ' of the 
clusters have to be studied. This process is already a classic one in mathematics 
where clustering takes place for sets endowed with different algebraical or 
topological structures. Empirical problems of taxonomy generally have to be 
examined with sets which do not have such properties; the criteria, in respect 
of which the classification has to be made, yield a distance or similarity function 
defined for the set of all pairs of elements. ‘The starting point for most clustering 
problems may be considered to be a matrix Ma which describes the distances 
(or similarities), whose order is the number of elements to be clustered and 
Whose entries are real numbers associated with the pairs 0 


ification of a set of elements 
t is only recently 
1 solutions have 
to be analysed 


Although the problems a 


a s ii 
according to some criteria hav 


f elements. 


2. Tur CLUSTER 
ways of approaching a mathematical 
One of these conceptions of a cluster 
and biology will now be described. 
imed that the numbers associated 

Ма) are non-negative integers 
„Су ОЁ radius 1,2259 N are 
E, of the set E of 


a From the above viewpoint, two 
efinition of a cluster suggest themselves. 
which is relevant to applications in psychology 
with Es the sake of simplicity it will be assu 
0,1 А, е pairs of elements (the entries in 
нерге сауы If the semi-circles Ср 

sented on the half-plane, then a su 


v 


bset Ep Ba++ 
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m elements will be in C; (and not in C; 1), if and only if all the q(g— 1/2 pairs 
which can be formed with these 4 elements are associated with entries in Ma 


which are less than or equal to k, and if for at least one pair the entry is & 


(Figure 1). 
FIGURE 1. "The representation of subsets within semi-circles on the half-plane. 
By Biss ss Еу a subset of the set E of m elements, 
Cy, Cs. 


-» Cy: semi-circles of radius 1, 2,... Non the half-plane. 


A system of subsets of E із chosen whic 

(a) the set Е which is in Cy belongs to t 

(6) only disjoint subsets whic 
in the system; 

(c) if the subsets S and S* are in C, a “ете і re j 
then either S C 5% or SU Stag p and C, respectively, where p< 4% 


The subsets of the system s 
© clusters 7. 


А second approach, which see 
] ms to be us ical 
problems (for instance, the clustering of i EL fo same aep 


h fulfils the following conditions: 
he system; 


h are in С, (ғ-0,1,.., N) may be considered 


ubject to conditions (а), (b) and (c) will be called 


OWN 


t 


А 
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(1965); the union of the subsets is the set Е, but according to (c) the subsets 


of the system are not disjoint. 

| А formal mathematical scheme to solve the clustering problem in this 

sense, already briefly described by Constantinescu (1966), will be presented; and 

its relation to the problem solved by Regnier (1965) will be briefly examined, 

шт programs in Algol!, embodying different variants of the algorithm, 
ave been written and an application of the clustering method in psychology 


is described by Stringer (1967). 


3. Tur INPUT INFORMATION 


„ Em} be a set of elements, and P= (Py Px .. Р») 


Let E= (Ey, En- 
der the function. 


a set of criteria. The criteria engen 
f: EopixPaX-- E 

., n) is the set of values associated with the criteria РІ. 

d by a matrix M, whose row vectors 


pondence with Ej, Eoo Em: 
on problem is a binary 
er the 


where pi (1-1, Bh os 

The function f can be describe 
„ ey are іп one-to-one corres 
are sets (0, 1} the classificati 
in a Euclidean space of dimension л ov 


el, €g.. 
For instance, if all the pr 
one for which the row vectors are 
field {0, 1}. 
Since the element 


associated with the elements of E, 
d : E x E—1, 


s of the pı are real numbers a distance function can be 


be considered as non- 
4 а constant to 
lar adjustments 
ej) and 


‘The values of d can always 
1 sometimes be necessary to ad 
by a constant; or make simi 
where dij = еъ 


where I= (0, 1, 2,.. NJ- 
negative integers, although it wil 
the entries in M, or multiply them 
to the symmetrical distance matrix Ma= (dij), 
4-0 (і-1,2,..., m). 


The function 4 сап also be described by а graph G (Figure 2), as follows. 


With every element of I a level is associated. On level 0 the elements of E are 
Considered as vertices. On level k, vertices аге formed by drawing lines from 
level 0 for pairs (Zi, Е)) for which di; =R. The vertices аге marked with the 
appropriate (Ei, Ej). This graph will be called the ' graph of pairs’. 

If the lines are not drawn, the pattern G' so obtained from G will be called 
the © representation G” of the graph G, or the ' representation of the pairs . 

In С (or G’) a vertex сап be considered, marked by the subset Еу Ер TT 
on level k (0 & k « N), if digit SR for all s t=1,2,. . 50 апа АҒ еге аге 
s',tte{1, 2,..., а) such that бар (Figure 3). The method for constructing 


such subsets (or vertices) will be given below. ts will be called 


€ 
level-subsets x 
1 Programs are available at the SRC Atlas Computer Laboratory, and at the i iss 


с 
entre of the University of Bucharest. 


These subse 
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The vertices Ej Е,,..., Ед on level 0 and the pairs of elements are 
articular level-subsets. | 
‚ The pattern (Figure 3) in which the level-subsets {E}, Eio, ә Ей) are 
displayed with respect to the distance function d will be called the ‘final pattern 
with respect to d. Different systems of clusters can be selected from it, fulfilling 
conditions (a), (b) and (c) of Section 1. | | 
In brief, the input information for the clustering problem comprises the 


matrix Mg, or the graph G, or the representation С” of the graph С. Each 
of these contains the same amount of information. 


Ë E, Е; Ej Em 
LEVEL (0) --e—-e-- 


LEVER (ае 
LEVEL (ijo — ess 


LEVEL (9) --- = -– -- 
(Ei, Ej) 
ШЕУЕЦН-)-------------------.. 
LEVEL (Н) -——— ————— аа 
Ficure 2. 
Ey Ey... 
Ei, Ej: 


The graph and the representation of pairs 


„ Em: elements of the set E considered as vertices. 
pairs linked at the appropriate distance level. 


E, Ei, Ei; Eig Em 


(0) ЕР T E 


Ficure 3. А level-subset marked by a vertex in a graph. 
Bassus s Em: elements of the set Е. 
Ж, Ea... E, 


UE level-subsets or vertices. 


4. DUALITY IN CLUSTERING PROBLEMS 


Having defined the function d, a new function s : E x E-J can be defined by 


slei ej)=s;=N-4q 
The function s is the similarity of the vectors e; е). The definitions with 
respect to s of the matrix M,, the graph G, the representation С” and the fina 


ij- 


i 
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pattern follow as f imilarity i i 
Alaa asst ог d. Similarity and distance are dual in the sense of the 


Lemma 1 


If (Ej, Ез... Eq} is a subset on level & in the final pattern with respect 


1 { 51 ; E isa subset on level 4 ki 
to d, then E ; Бы “жар 5 1 1 ren wi 
et aa | 2: ; а) ч N in the final pattern with 
Proof 

$^ Eq} is on level k, then all the 4(4-1)/2 pairs 


If the subset (Ej, E» - 
are on the first k levels in G: that is: 
dij <k for all i, j for which En Ej; Є{Е E» ++» Eq}; 
-dy> —h for all i j for which Ei, Eje {Ev E», ++» Ea); 
N-dgz N — R for all z, j for which £i, E; e{Ey Ez.. Eq}; 


sz N —k for all i, j for which Ei, Eje {Ev Ey «++» Ег}; 
Hence there are 


and Dues are i,j'e(b 2 ++ 5 а) such that dij =R. 
LÜje(1,2,... qj such that ssp = N-h then Ey Ез». Eq is a subset on level 
For the reciprocal situation the 


7 d ; 
№ А on the final pattern with respect to 5. 


proof is similar. 
On the basi 
respect to either d or s, and 


algorithms can be elaborated with 


s of this lemma, clustering 
duality in the other case. 


the results used by 


5. Tue TREE AND THE TREE REPRESENTATION 


Definitions 
(I) The level p is inferior to the level q if p> 4 
(II) The vertices on the level 0 will be called 
vertices on the last level (N) will be called 

, ап) * Absorption ' is the operation by level-subsets and 
pairs of elements, another level-subset is obtained which is equal to their union 
ze placed on the most inferior of the levels on W i г level-subsets 
eub placed. This operation is permissible if and o pairs and level- 
e sets which make up the new level-subset, all the pair 

ith the elements of their union are present. 
fi Any of the pairs and level-subsets which mak 
is ay or may not be included in the new pattern in W 
1 included. ‘Those which are not repres 
evel-subset union. 
dig; Тһе final pattern in wh 
E systems of clusters 
ab ion 1, is obtained from the graph of 

Sorptions. 
th By absorptions a final vertex re 

€ last level. 

в 


‘ initial vertices’ and the 


‹ final vertices ^s 


subset union 
subset union 
bed by the 


e up the level- 
hich the level- 


ented are said to be absor 


are represented, and from which 
g to (а), (b) and (о) of 


ich the level-subsets 
ion of pairs) by 


can be selected accordin: 
pairs (representat 
presenting the whole set E is obtained on 


S.P. 
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(IV) A graph G, in which vertices marked by level subsets are displayed, 
will be called a ‘ tree’ if 


(i) it has only one final vertex, and 


(ii) one and only one branch goes out to the inferior levels from every 
vertex, including the initial vertices. 
According to the definitions of the graph of pairs and of absorption, а 
branch will be drawn between two vertices S and S*, if and only if S с5* and 
there is no vertex on an intermediate level representing a subset T such that 


ScT. 
Different trees can be obtained from the graph of pairs by absorptions. 
(V) А representation С” in which level-subsets are represented will be 
called a ' tree-representation ’, if 
(i) it has only one final level-subset, 


(ii) on each level the level-subsets are disjoint, and 


(ш) a level-subset 5 on level s(s#0) contains at least one element of the 
subset R on level z < s, such that RCS. 


Different tree-representations can, by 


: г absorption, be obtained from the 
representation of pairs. 


Lemma 2 


The level-subsets in a tree-re 
any elements on level 0) are clust 


Proof 


(1). The level-subsets in a tree-represent 
Section 1 follows from V (i). 
last level) it must be the whol 


presentation (including pairs of elements or 
ers, and conversely. 


ation are clusters. Proposition (а) of 
Since there is only one final level-subset (on the 
ч e set E. Hence the set E belongs to the system. 
From V (ii) follows (b) of Section 1. From V (iii) it follows that for the 
level-subsets S and S* in C, and C, respectively (P <4), there are only tw? 
possibilities: i 
Sc S* 
SN S*¥=¢, 

(2). The clusters are level-subsets in a tree-representation 

From (а) follows V (1). If there were another fi 7 i ld be 
absorbed by the final level-subset E. al 


From (b) follows V (ii). On each level the level-subsets are disjoint. 


Erori. 9 | тү 62. f the level-subsets S and R on level s and rre) 
respectively are not disjoint—that is, S conta; f the 
subset R—then Ac S. ains at least one element 0 


^. 
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THEOREM 1 
G'i Ө Те арады 
Е. е ey representation if and only ifGisatree. By erasing the branches 
се С, a tree-representation С” is obtained; by drawi 
1 а rawing th i 
a tree-representation С” a tree С is obtained. " ОТЕ МАР 


Proof 

€ Шы no By erasing the branches a system of level-subsets with 
wd за (i) (ii) and V (iti) is obtained. Clearly V (i) follows, namely 
s only one final level-subset. 
. It will now be shown by negation t 

disjoint. mS 
Let there be two level-subsets S and 5% 
ue ы ! Let E:N. There is а subset M on level p«r such that 
mm cNcs (finally M= {Ei} will be on level 0), and also a subset M* on 
2 such that Eje M* c N cS*. If M-M* (in this case р= 4), two 
тұта hes go out from the same vertex to S and S*. This contradicts V (ii). 
M М* the process is continued using M as S and M* as 5%; finally stopping 
when p=q=0 and M=M*= {Ei}. Hence V (ii) follows. 
" I'he argument by negation also shows that V (iii) follows. If E;€R and 
n and Ас S, then a branch cannot be drawn from vertex R to vertex S. 

his contradicts IV (ii). 

(2, Let С” bea trce-representation. 
is йе ыш V (ii) and V (iii) it wil 
rawn to S* on level s (where r < $ 
pem r<p<s) such that SC TC 5%, 
he level-subsets on inferior levels are, 
E of them include S. At least one inc 
evel Let all the subsets which include 5 be 
51 оп level à 
5, оп level zo 


hat the level-subsets on each level are 


on level 7, such that N=SnNS* 


V (i) implies IV (i To prove that IV (ii) 
| be recalled that a branch from 5 on level 7 
), if SCS* and there is no T on level p 
Let S be a level-subset on level r. All 
according to V (iii), disjoint with S, or 
ludes S (for instance, E on the last 


S, on level it, 


for all k for which 1<R <t. 


where ғ<4<4,...і. Sc S; 
By V (iii): 
SC S1; SC Ss; аа Siac Se 
(B Since Sc S,C Sy... С St, one and only one branch goes out 
Lond gei levels ғ and /, there is no intermediate level on which 
that Sc 8%) Hence IV (ii) follows. 
Corollary 
Тһе vertices іп a tree are clusters, 


from Š to Sy. 
there is an S" 


and conversely. 
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6. THE EXISTENCE THEOREM 


A tree (tree-representation) will be called trivial if it has i boul 
vertices (m is the number of elements in the set E), namely one final vertex 
m initial vertices (Figure 4). 


(Ep Ez, E3——- En LES) 


Ficure 4. Тһе trivial tree-representation. 


Ey Ej, ..., Е,: elements of the set m. 


THEOREM 2 


А final pattern contains a 
only if there аге А level- 


(i) 1«k«m, 


қ г if and 
t least one non-trivial tree-representation if an 
subsets with the properties: 


(ii) at least one level-subset is not on the last level, and 
(iii) the А level-subsets represent a partition for Ж. 


Proof 


(1). Let there be a final pattern in which ther 
properties (i), (ii) and (iii). It can be deduced 
not form the partition 2—0; 


. һе 
е are Ë level-subsets with t 


0 
from (i) that these subsets 
(Ei) or the partition E = E. 


From this pattern the following tree-representations may be obtained: 
9/3, the set E on the final level; 
^ o the initial Subsets on leve] 0; 
(E, Mesas. s {Em}, and 
о, the k level-subsets, 
From property (i) follows Ү 


absorbed by Е. Since the k subs, joint, then the subsets are also disjo 
on each individual level; and the initia] Subsets are also disjoint. Hence (i 
follows. "Тһе proposition V (iii) follows from the fact that inclusions are 00 

permissible between the initial subsets and the А subsets and Z. 


" аге 
(i); all the other subsets on the last level 8 
ets are disj 
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2 > s 
N — > a non-trivial tres-representation in the final pattern. There are 
s sh the properties (i), (ii) and (iii) of Theorem 2. Because the tree- 
dise к non-trivial there is at least one level-subset S on a level s 
К 5 ta 22 k level-subsets are the subsets S which are disjoint 
Dye e subsets (Бұ) which are needed to form by union the set E. 
ows that 1 < k< m and the properties (ii) and (iii) are fulfilled. 


Corollary 
PES. Theorems 1 and 2 it follows that a final pattern contains at least one 
ivial tree, if and only if the conditions of Theorem 2 are fulfilled. 


7. COMPUTING LEVEL-SUBSETS 


Before the computation is described an example will be given. 


Assume the following input data: 


i 1909110000001 
я 1005171001187 
і та 580011000018 
а 1іфтіРіз 16% 
s + 0001 0 Q 00 0 0 
Мо- 6 1117061100855 
z Q Q 4 4 D 0 0 D Ç ü D Ü ( 
ЖЕТЕТІЗІІЗТЕЗН 
s iggpmiírtot ti? 
10 11601 0010р? 
The matrix Ма of distances between elements is: 

V"EwTITLILLL 

ааа жін. 

ЖЕТЕЛІ 

ate wv © ` 

ЕЕЕ 

та 

à ws? 

0 6 8 5 

0 8 11 

0 7 

0 
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The representation of pairs is: 


i ж (19) (5,9) 

2 (1,5) (2,9) | 
3 (2) Q4) (25) (610) 

4 (з) (9 G9 (62) 

5 (2,3) (3,8) (3,9) (4,9) (7,10) 


6 (1,10) (2,8) (3,5) (%5) (4,10) (5,10) (7,8) 


7 (9 (2 G) 49 G6 G 1) 
8 G1) (69) 09) (89) 

9 (18) Q0 (46 (лу (55) 

10 (2,6) (2,7) (6,8) 

11 


(6) (810) 


The representation of pairs is re 
rows are in one-to-one corresponde 
occupy the first position of the pairs 
on the corresponding row of С: 


56 
Placed by two arrays G, and б» wc 
nce with the levels. Тһе indices W^... 
1 огде 


Оп a given level are written in natura 


Pe : 1; the corresponding indices for the secon | 
position are written on each row of Gy. In the present example: 
G: 1 5 G,: 9 9 
f 2 I 5 9 
1226 2 4 510 
] f 3 6 3447 
: : To 3 3 8 9 910 
3345 10 8 5 51010 8 
- : | 8 10 9 9 9 
% 5 810 6 7 8 
2 2 6 67 8 
3 8 6 10 


ы = 
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йу кз ei à the level-subsets is effected by successively exploring 
jede | a cn y баға оҒа variable b=1,2,..., 11 and by applying 
Costus sU P ма TRIANG. А Ву this procedure the level-subsets аге 

pedes е h s starting with the representation (or graph) of pairs. 
Nxf marie а ue of b iw arrays T, (an Nx1 column vector) and T, (an 
Which are um к (where / is the maximum number of indices 
hid op Ha e op a line in G, or G». All the distinct numbers which can be 
nnmis te ся rows іп G, are written in natural order in Т,. All the 
an m I e first b rows of б, which are in correspondence with the 
неет а from Т, аге written іп natural order іп the corresponding 

> All the other elements іп T, and Т, are 0. 


In the example, if b=3, then: 


1 2590000 
2 4590000 
5 900000 0 
T,= 6 Т.- 10 0 0 0000 
0 0 0 
0 0 0 


el b, the square matrix, R, 
d column has at least one 
with b=3, the resulting 


is "es order to determine the level-subsets for lev 
Ca from T, which for each pair of row an 
А ent different from zero. In the present example, 
quare matrix is: 


2590 
4590 
Е- 9000 
10 0 0 0 


ei Then from Т., all the triangular arrays U, are formed (with the шың a 
май order) such that the elements which lie on the second diagonal are 


е ix R. 
d to one of the members of the second diagonal dus due dyes 
po each array there is associated a column vector U, puer 1 So for level b 
m T, which correspond to the elements of T; included in Әз» 


all 
arrays U, and U, of the following kind are found: 


i аш c 

% 13 М tk 
U, = is Um, >: tk 

ірі % 
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In the present example the arrays for 5 —3 аге: 


U, U, 

1 2.5 9 
2 5 9 

5 9 

6 10 

1 5 9 

5 9 

The following theorem will now be proved. 


THEOREM 3 


If a set (2,1,...,ір) obtained from a given U, and U, is not included in 
a level-subset on a superior level (including 0) then it is a level-subset on level 5. 


Proof 


digi, <b, since is, 4e( iy... i) Because the set is not included in a 
level-subset on a superior level there is at least one pair s’, t’ such that 4,4-0. 
All the k(k—1)/2 pairs which can be formed with Z,2,,..., iy are present in 
U, and Uy. The latter contain distinct pairs; and only pairs formed with 
elements from ij, is, . . ., ix, і.е. exactly 1+2+3+...(В— 1)=A(k—1)/2. 

In the example, the sets (1, 2, 5, 9), (6, 10), (1, 5, 9) are obtained. The set 
(1, 5, 9) is not a level 3 subset, because it will be found to appear on level 2. 
(In order that only those subsets which are not included on a superior level are 


written on any level, a computer procedure INCLUD is used). 
In the example this leads to the following subsets: 

(1,9) (5,9) 

(1,5,9) (2,9) 

(1,2,5,9) (6,10) 

(1,3) (3,6) (6,7) | 


(1,2,3,4,9) (6,7,10) 
(1,2,3,4,5,9) (1,10) (4,5,10) (7,8) 
(1,4,5,9,10) (1,5,6,7,10) (3,5,7) 
(1,3,4,5,9,10) (1,5,6,7,2,10) (2,8,9) (2,3,8,9) 
(3,5,7,9,10) (3,7,8,9) 
9 (1234589) (1,3,4,5,7,5,0) (1,4,5,6,7,9,10) (1,2,9,10) 
(1,2,3,0,10) (1,2,3,4,9,10) (12345.0410) (3,4 5,7,0,10) 
10 (1,2,3,4,5,7,8,9) (1,4,5,6,7,8,9) (1,2,4,5,6,7,8,9) І 
(1,2,3,4,5,7,9,10) 
11 (1,2,3,4,5,6,7,8,9,10) 
By slightly modifying the algorithm, it is possible to exclude thosc 
level-subsets which аге contained in level-subsets which are situated 
together on any lower level. The result is the final pattern from which 


со мї Ov Cn + оого — 
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different s 
ystems of cl 
and (с) of Secti usters can be selected b А 
ection 1. у absorpt а 
can, for inst From the final pattern of the ex ptions according to (a), (P) 
ance, be selected. e example, the trees in Figure 5 


15923486710 61071529834 
1 
2 
3 
4 
5 
6 
7 
8 
9 1,2.3,4,5,8,9° 
10 
11 

1,2,3,4,5,6,7,8,9, 10 


1,2,3,0,5,6,7,8,910 


Ficurr 5. Trees іп а final pattern. 


Worle 4 $. COMPUTING THE CLUSTERS 
hose subse : қ FE : 

e subsets are written on every level which are disjoint with the 

isjoint with or include 


evel-su 
tbsets alre В 
already written оп the same level, 2 
obtained in the 


the 1 
evel-subsets 
ubsets on preceding levels, then a tre 
evel-subsets it is possible to find all the 


trees 
S that 1 
at may be associ 4 ; eal 
y be associated with any given classification problem. 
d biology it seems to b 


шы тс жаб) фе te in psychology an 
е final x Ble a final pattern from which many different trees can be selected. 
problem then concern f level-subsets producing the 
the number of classification 


e more 


cluste 
ters, and i Ф Í 
Possibilities p pei the number of trees; that is, 
. То determine this complexity of the final pattern of level-subsets 
г described elsewhere (Constantinescu, 


Ve alte 
rnati B 
; 6); а algorithm have been briefly 
€gnier (обе b call are also available. i 
5) is called the ‘ one-level cl 


lere 
may b. etho 
cal Н ing m 
e called a ' multi-levels clusterir £ 


If the clustering hod described by 


ustering method ’, the method presented 
, 
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COMPARISON OF FACTORS BY AHMAVAARA’S METHOD 


Ву Max HAMILTON 
University of Leeds 


Th ili i i 
e comparability of factors obtained from different sets of data has always been a 


Problem of inte: ў 
m of interest but great difficulty to psychologists. A number of methods have been 


m ewm but of them Ahmavaara's (1954) is th 
y fairly safely be used when the variables in the popul 


this it hz omit 
it has been little used and rarely referred to. In the book by Harman (1960) there is a 
of comparing different factors is 


е simplest, neatest and most elegant. This 
ations have equal variance. Despite 


brief = 
deste to the work of Ahmavaara. The method 
ed by Fruchter and Jennings (1962), but unfortunatel 


descripti 

Scriptio D i ; 
ption. For this reason an account of the method is thought 
"Phe basic theorems and the proofs of the formulae are 


loes give a useful introduction to the method. 
f 152 men and 120 women suffering from 

scale (Hamilton, 1960). From the 
f intercorrelation were calculated and 

These factors were then 
to the criterion of Kaiser 
turations have 


у there is an error in the 
to be worth publishing 


ee to a practical example. 
Th but the following simplified account c 
2 primary. pus inthis example is based on ratings о 
ratings for a illness, using а 17-item rating 
then furto he two sexes separately, two Matrices о 
rotated by ена by the method of principal components. 
(1960) six fa he Varimax method (Kaiser, 1958). Conforming 
een “чеш 4 were extracted. (The correlation matrices and factor sa 
The six elsewhere (Hamilton, 1967)). А а | 
е six (orthogonal) factors constitute the basis of a six-dimensional subspace of the 


total spac < 
Оен are of 17 dimensions. Since the variables used for the rating of the men and the 
are the same, the two sets of factors define two subsets in the same total space. It 
t of axes (factors) to occupy roughly the same 


may 
pap He de possible to rotate one set о! | r “he 
occupy the he second set by means of а suitable transformation matrix (provi ғ a ji 
actors as von: subspace). Such a transformation matrix will show. the secon TE 
sets of е inear combination (weighted sum) of the first set. It is obvious ca €— 
the other a wee initially identical, the transformation matrix required to m = a 
the same would be the identity or unit matrix (provided that the two 8 o em columns 
а: If the order is not the same, the unit matrix will hav dece ues cm 
matrix will Қы the two sets of factors are not quite identical, then tl deine ae 
resembles, е not quite a unit matrix, Thus the way m which the е 8 д Ееее" 

etween К i differs from, a unit matrix will correspond to ier? с-а Row 
Show the E EVE: ы of factors, Even more important, 'orma 
š Ur ber dapi of the second set in terms of the first жы, ere деде 
Ing the va | that the two sets of factors are represented by Xand Y, йана uad 
then th ariables and the columns the factors (there being therefore е 

€ transformation matrix proposed by Ahmavaara is 
а ds vs that when X and 
formula for L shows that w 
for variables to be 


e it is customary tor ° 
normalized, i.e. the кй 
This is represented by Lx: 


crf ix Y e rows that Ot ^ 
‘tha ee of L represent the common factors of the matrix y кй a M" e 
actors £ cach row of Ly represents the projecti g) of pto горе Z дыла 
„Ж. Y. Ahmavaara’s own words cannot be bettered ( рд: n de factors 
rs of the matrix 7» 


its c 
olumns represent the common facto! 


. NOTES AND CORRESPONDENCE 


the trans 


Wher 
ех”; 
is the transpose of X. Examination of the 


Бағ: 
re identic; | е 
entically equal, then L is a unit matrix. Sinc 
of L should ber 


4 терге 
Sente -— ñ 
ne eee, by vectors of unit length, the rows 
S of the elements of each row should sum to unity. 
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being the same in L, as in Y. On the other hand, the rows of L, represent the common 
factors of X, the order of the factors being again the same in L, as іп X. The elements of 
L, then indicate the cosines of the angles between the factors of the different studies or, 
more accurately, between the vectors representing these factors. Consequently, if some 
element of L, is exactly equal to unity, the respective factors of the two studies are 
identical". The correlations between the factors in the two sets can be evaluated by 
calculating L,L;'. Since L is the matrix for rotating X into Y, then the product XL 
should correspond closely to Y. This serves as a check on effectiveness of the comparison. 
Finally, it must be remembered that since L is not an orthogonal matrix, the rotated factors 
are not orthogonal. Should orthogonality be desired, a suitable method is described in 
Lawley and Maxwell (1964). 

In the case of oblique factors the procedure is a little more complicated. To ensure that 
the vector is of unit length, the normalization must be carried out before the oblique 
rotation, for only with orthogonal axes does the sum of squares come to unity (extension of 
Pythagoras’ theorem). ‘Thus if F represents the matrix of factor saturations, the factors 
being orthogonal and obtained by any method, \ represents the non-orthogonal rotation 
matrix and Y is the matrix of factor saturations in the rotated oblique factors, then 


for factors X: F,A,— V; 
апа for factors У: FyAy=Vy. 


The initial transformation matrix L= (V Г): Ру. When the rows of L are normalized, 
then L, is the transformation matrix which compares I’, with Fy. ‘Then (quoting again) 
“when L, is postmultiplied by ^, the results Г.А, is the matrix of comparison between the 
primary factors of X and the reference factors of Y. "Го be able to compare the primary 
factors with each other, each column has to be multiplied by a constant, obtained by 
calculating (A,’A,)~!, then taking the diagonal elements as a diagonal matrix D?, and 
postmultiplying the transformation matrix by D-!, thus: А,” 

The procedure can be applied for transforming Y into X. When the variables are not 


the same in the two sets of factors, the method can be applicd to those variables that are 
common, ignoring the others. 


Comparison of Factors in Depression 


As a guide, the method was first applied to the principal components. "Тһе results 
(Table 1) show that the first factor for women (WF) corresponds almost exactly to the 
first factor for men (MF). There is very good agreement between WF, and МЕ, There 
is fair correspondence between WF, and МЕ; (but the scores run in the reverse direction) 
and for WF; and МЕ, WF, has a fair correspondence with МЕ, but an even greater one 
with MF,. This leaves WF; and МЕ; as the remaining unmatched pairs, but in fact УУР» 
corresponds more closely to MF, and MF, than to anything else. "Тһе general conclusion 
to be drawn from examination of thesc results is that the first four factors obtained from 
the matrix show reasonable resemblance between the two sexes. The lack of adequate 
correspondence between the last two might be put down to instability, Ап alternative 
suggestion is that the matrix contains only four factors worth considering. 


TABLE 1. COMPARISON oF FACTORS BY Anwavaana's METHOD: Principal. COMPONENTS 
From 152 men 


1 2 3 4 5 6 
1 1:00 - 0:03 0-08 0-04 0-00 — 0-02 
2 0-04 0-93 - 0:00 0-28 — 0-05 — 0-24 
4 0-11 0-46 — 0:70 =0:42 0:33 0-41 
Етот 120 3 0-01 — 0:22 0-46 0-68 0-48 0:22 
women 6 0:65 0-08 — 0:41 — 0:34 — 0:51 — 0:16 
5 0:06 0-39 0-66 0-59 -024 -0:03 
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TABLE 2 PARIS ОЕ FACTO! BY AHMAVAARA’ нор: VARIMAX 
Е С OT: 
2, OMPARISON 
RS h s МЕТ) 
MAX ROTATION 


1 
1 0:97 a қ : 3 
2 -021 093 о Е 0и 1 
Fr 6 001 024 Ies ОШ 0-13 7020 
om 120 5 0-01 ae Ei -0:30 -05 Da 
VOTED 4 — 0:26 —0:25 0-92 0-53 —0-36 
: m — 0-01 —0-29 —0-05 EA 0:16 
49 035 047 — -044 n t 


Com $ ч 

: ? parison о ` ү а 

ОЯ раты А factors (Table 2) confirms the belief that the method of 

women WV у the presence of invariant factor st " SE eat 

Non DIE r кеч а structure. "Тһе first factor f th 

correspond 1 р ery closely with that of th 7. ое 

ence betwee n JA а e men MV,. eee 
тке бос бейе nines beh a and MVg, and also between WV, and beu T sci ah 
etw IV, 2 no 5 a ere is fai 
ponent fror ween запа MV5, but the former contai pus 
к m MV,. à 5 rmer contains also a lai i 

MV; and simtlag si Li Ys corresponds most closely to MV, but has a large ioni ae pea 

MV, but си га from MV, and МУ. Finally, WV; corresponds sii 4- die 

between all six fn arge components from МУ, and MV. Since fair corres nade = 
six factors has been obtained, the conclusion is that the possibility of = 


stabilit 
y of the tw: inci 
the two smallest principal components is a reasonable one. 


Гав 3 
LE с 
. OMPARISON г FACTORS МАУ. ү 
м OF FACTORS BY AHMA AARA's METHOD: ARIMAX ROTATION 


From 70 women 


1 2 3 4 5 6 

: nom -049 —0:37 —0418 004 —008 

" Ae -0:86 0:10 -0416 —047 0-18 
тот 50 6 / - 0:06 —0-70 -0412 0:70 —0:01 
Women 4 0-09 0:25 -037 —0:81 -037 0-03 
5 0-31 — 0:58 0:39 0:60 0:03 0:22 

0:27 0:57 -021 0-31 -025 0:63 


stical significance for Ahmavaara's method. In 
espondence between the rotated factors 
lecided to compare the results from two 

d in Table 3 and, 


U 
an аны шыл, there is no test of stati 
obtained ds to determine how good is the corr 
Successive oe the data of the two sexes it was ¢ 
mentioning apies of women, the first 50 and the second 70. This is foun! 
Teasonably he factors from the sample of Table 3 the 50 women first each time, there is 
Clinton he correspondence between the factors 1 in the two groups. Factor 3 

and 5, ын well with Factor 2, and Factor 5 is almost exactly half-way between 

Considerap uctor 0 has reasonable corrcspondence with 4 and Factor 2 with 6, but has 
ras two екен from Factor 2 of the group of 70. This leaves Factors 4 and 5 in 
ro fore oor s unmatched, Factor 4 corresponds most closely with 4 and 2, There 18 
oe has гч рган | іп the identification of the two sets of factors. 
па It wo oe оп the resemblances found int 
ihe AER the a uld seem, on the evidence, that the Varimax meth 

difficulty p pearance of invariant factors. Perhaps the restriction 0 
d x to say vem when this is removed, other difficulties ippon МА 5 tomatology 

pressive dew, c possibility that there is an essential difference a КҮТ! a 

s in women as compared with men cannot be ЕХО ЧЧ“) 
geat: 


Sona Dle t s 
9 say that any difference will net 1 


arburton, 5) 
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BOOK REVIEWS 


ашуу Sampling. Ву LesLie Kisi. London and New York: Wiley. 1965. Рр.хуї+ 
w 835% 


For the non-specialist wishing to acquire the fundamentals of the theory and practice of 
sampling human populations, this book is very nearly the ideal starting-point. Just under 
half of it is concerned with the fundamental methods of survey sampling, using mathe- 
matics no more complicated than sixth-form algebra. "Гһе second half is a detailed exposi- 
tion of more specialized techniques, including a long account of the practice of area sampling 
in the U.S.A. 

In its contents, and in its level of exposition it must be compared with the two leading 
works on sample survey theory, Yates' Sampling Methods for Censuses and Surveys and 
Cochran's Sampling Techniques. he former of these is a detailed manual for survey 
practitioners, with little or no mathematical derivation; the latter is a thorough introduction 
to the statistical theory of surveys. Both Yates and Cochran are specialist works, in 
different senses; the one is for the specialist in practice and the other for the beginning 
specialist in theory. Professor Kish’s new book is a nice mixture of the two, with enough 
theory for understanding and enough detail for practice. It has clearly been taught and 
re-taught to the great benefit of the text, which is dense with lively and occasionally provo- 
cative remarks. It is full of good examples and problems, and its pervading air of realism 
stems from the practical experience of the author. Like Cochran, Kish is concerned 
primarily to teach; like Yates, he provides a reference book for techniques. 

This is not a book for survey theorists, as is evidenced by a sentence on page 229 which 
dismisses a great part of their last decade’s research as “ too complicated, I feel, for our 
purposes”, "Тһе emphasis throughout is on easily computable and reasonably efficient 
procedures. There are very few errors in the text (all of those which I found are corrected 
in an advertisement in the June 1966 number of the Journal of the American Statistical 


Association, three pages after page 575.) 1 should be surprised if this book did not have a 
very long life. ALAN STUART. 


London: Methuen, 1966. Рр. 181. 255. 


Contrary Imaginations is written іп а lucid style, = алй pip ee 
contrast to much present-day writing in psychology. In this and ot е зеза i 
is an admirable exponent of his own principles. ‘Thesevare set биР ае E [e inam 
he sticks to them throughout the book—no jargon, no complex реле үң E r untestable 
Subject matter, no ‘ blind ' theories based solely on statistics, nO PSR rdi disci lines of 
theories and, on the positive side, a desire to reconcile to A ы 2 ue they 
mental testing and psychoanalysis. The test of these princip es, eens ont ane 
lead him, and they lead him quite а long way. But itis, perhaps, СЕ behaviourism which 
tics... "— surely as dogmatic in its way 45 l ri tt ener him now and again in а 
Dr. Hudson criticizes on the grounds of its rigidity—tha eschew multivariate 


7 his own purposes, 
— тиынды e ey Rm in results cannot be referred back to 


methods of analysis (presumably because he he logic underlying simpler 
human subject matter) but he fails, I think, to understand the 108 ы 
Statistical methods, e.g., the effect of selection on correlation. ends dt^ 
Тһе main part of the book consists of a description of the er e 
boys in the upper forms of public grammar schools in tests e E itiss: quesid 
argely borrowed from Getzel and Jackson) and in a persone я T d, probably b 
r. Hudson selected boys from the upper levels of general ability апо, La out—namely, à 
this, two other well-recognized dimensions of the group factor variety 5 


Contrary Imaginations. By Liam HUDSON. 
simple, 


reasonably clever 
ace and creativity 
onnaire ". 
ecause of 
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bias towards either verbal or‘non-verbal success in the intelligence test, and a bias towards 
divergent or convergent thinking in the tests of creativity, А 
Statistical tables at the end of the book give indications of these trends, and the tests 
(including creativity items) which demonstrate them; and for sub-groups of boys there = 
(i) intelligence scores of future scholars, exhibitioners and commoners at Oxford 188 
Cambridge and of boys accepted at other and at no Universities, and (ii) sixth-form subjects 
chosen by convergers and divergers. i . 
"The book ends with a tentative explanation of the differences between the intelligent 
convergers and divergers in terms of parents' (I nearly wrote “ parental ’, reading this book 
has cured me of that) personalities and their influence on offspring—the * double-bind * 
Dr. Hudson's main contributions would seem to lie first in approaching his subject- 
matter from an essentially individual angle. This emphasizes, by descriptions of selected 
cases, the boys who are not neatly classifiable by the tests, and whose choice of university 


subject and whose class of degree may be unpredictable. Second, and following from this, 
there is his openness of mind in allowing, in a matter of fact way, a place for home atmosphere, 
emotional disturbance, and psychoanalytic mechanisms, in his individual studies of cognitive 
processes. 


However, unless he is Prepared to publish the kind of statistical tables which at least 
allow the reader to count heads, it is not clear how he can assess the scope and importance 


of the problems that he hints at in his selected case studies. As things are, it is impossible 
to separate his evidence from his speculations. 


Nevertheless, the book is interesting, and the approach an original one. Psychologists 
should remember though, that Dr, Hudso 


0 i Dr n is not the first to combine a genuine interest in 
children with the study of cognitive ability, CHARLOTTE BANKS. 


ychophysics. By D. M. GREEN Я ETS 
London and New York: Wiley. 1966. Pp. 455. 104s. кча. Sw 
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of TSD in Psychology. Part I is largely statistical, describing statistical decision 
theory às a model of choice behaviour in the detection task. "Topics systematically covered 
include alternative assumptions and their consequences, possible experimental paradigms 
and the form of the data expected by the theory, methods of analysing data, and comparisons 
among various models of behaviour in a detection experiment. This section, together with 
an admirably constructed appendix on experimental techniques, is sufficient to give the 
reader a working knowledge of the theory and its application to psychophysical problems. 
Although the theoretical exposition is well done, it is limited in scope, covering only the 
statistical decision procedures for the simple detection task and the m-alternative forced- 
choice task. "Тһе authors have not included any formal treatment of sequential decision 
procedures, an arca of interest in statistical decision theory and probably of importance to 
Psychology. "Тһе relationship of TSD to Thurstonian scaling is mentioned but not 
elaborated, and no effort is made to connect T'SD with similar approaches to the same basic 
problem such as information theory and probit analysis. 

As a first approximation to a description of the scope of this book, one can fairly state 
that few excursions are made beyond the topics contained in the collection of papers edited 
by Swets (Signal Detection and Recognition by Human Observers, New York: Wiley, 1964). 
As a result, every piece of formal or mathematical exposition on TSD can be (and is) 
accompanied by data, giving the work an apologetic air. On the other hand, such con- 
Servatism in “ sticking to the data " is refreshing in view of the current rambling literature 
of mathematical models with more parameters than data. 

Тһе second part, dealing with sensory processes in detection, tends to be more mathe- 
matical than any other part of the book. Contrasted with Part I, which reads rather like a 
book on psychological statistics or measurement theory, Part II draws heavily on results 
obtained in electrical engineering and will probably discourage most readers without a 
mathematical or engineering background. Ап appendix is included covering enough 
waveform analysis to make the section understandable to unsophisticated but determined 
readers. Part II is a graphic reminder that TSD is а stimulus-oriented theory requiring a 
description of the signal and its noise background and leading to a specification of the 
performance by an ideal observer. This approach means that there are very few free or 
hypothetical parameters, since most parameters can be measured at theinput to the иеде 
The freedom allowed by the theory is in the form of various levels of uncertainty about а 
signal and internal noise which can be postulated to degrade optimal performance in such a 


i i гі б i coustics 
Way that it describes human performance. "The price of such a theory in psychoa E 
Ideal performance is more difficult to 


is apparently some complicated mathematics. ne nore өза 
specify for the visual modality, апа there has been hardly апу analysis of this kind for о 


i r 
modalities. The approach using the ' ideal observer T concept аля eene] ide 
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Logical Foundations of Psychological Measurement. Ву Srrancr Ross. Copen- 
hagen: Munksgaard. 1964. Рр.152. 36kr. 


A number of significant contributions to psychophysics and measurement theory have 
come from Scandinavia in the last decade. The majority are from Sweden and are con- 
sistent with, or at least sympathetic to, S. S. Stevens; this monograph is from Copenhagen 
and follows the more traditional position of Campbell. š 

The author moves in four steps towards what he considers to be a demonstration of the 
inadequacy of Stevens’ position; a review of semantic systems and propositional calcu 
leading to formal definitions of representation and isomorphism comes first and follows 
Carnap or Tarski on many points. This review should be useful in itself as a background 
reading for mathematical psychology courses if students have 
symbolic logic and have used a notation similar to that in Cop 

From his axioms Ross then outlines elements of 


Ordinal and additive measures are separately treated in that order, each being introduced 
through appropriate calculi and interpretations of these. As might be expected from a 
follower of Campbell, additivity plays a central part in Ross’s measure theory. Р 
In his calculi of addition, Ross introduces the ‘ axiom of continuity ' and notes that it 
creates difficulties; he uses weight as a topic for a measurement theory in physics and 
immediately he is obliged to introduce the possibility in statement 5,4 (page 53) of breaking 
up given objects, an operation which may not have an analogue in psychological measure- 
ment. This would not matter, except that in chapter three Ross tries to parallel his treat- 


ment of physical measurement in his psychological theory, and his treatment centres on 
Campbell's distinction between extensiv 


npb "€ or fundamentally measurable dimensions and 
derivative measures, 


| _His determination to subsume physical and 
similar formal structures is nicely put (page 68) 


that all knowledge within any branch of science may be stated by axiomatized theories of 
the structure Previously described in this monograph (my italics) . . , consisting of а semantic 
System and a calculus (and a set of constancy rules), Asa calculus is completely inde- 


pendent of any interpretation it follows that it will not be possible to discriminate between 
the different branches of science on th mployed within these branches.” 


From this position Ross is led, w ologi 
кошш г ‹ gical measurement theory as а 
р ss е с r ) hic to either the calculus of weak order or 
the scale types ean к prr C ps ra LN m p? 
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п axioms of the general theory, Ross gives (page 110) 
NA 2-N (аў 
AAS О) = N Œ) +N (y) =2P(xAy) 
(в) = М (x) + N*(y) =2P(xA*y) 
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where А and 4% are operations of combination N and N* are number assignments according 
to the two methods involved, and P is an equivalence operation corresponding to ‘ = ' in 
the number system. Не then takes two elements So and 59% such that syPso*, and applying 
ХААЗ gets 0,P(so455*), from which he then gets 

(3.10.6) — N (s) -N (5%) = N (04) 

and 

(3.10.7) Му) + N*(se*) = N*(02). 

However, as Eisler in 1965 pointed out, (3.10.7) implies 0,P(s,4*s)*) and we cannot 
Properly state that 04P(spAso*)0;PD(spA*s,*). On the contrary, it is empirically very 
unlikely that two measurements systems differing іп 4 and 4* will yield the identity 
Х(0,)- N*(0,). 

So the central theorem on which Ross’s thesis of an isomorphism between physical and 
psychological measurement rests is invalid. Ross’s reductionist attempt at theory leads 
him to state, in effect, that if a psychological scale of x-ishness exists it would necessarily be 
the same as the physical scale of x-ishness. Obviously this position is in head-on conflict 
whit Stevens’ rationale for constructing psychophysical power functions, and Ross is 
concerned to offer an account of what Stevens is actually doing when he uses magnitude 
estimation or production. Ross does this by asserting that Stevens is developing a sub- 
jective and not an objective psychology: “ the criticism of the conception (presumably) 
underlying Stevens’ scaling procedures may be directed at the most extraordinary fact that 
according to the conception, it is left to the subject to perform those operations performed 
exclusively by the experimenter within an objective psychology ". The subject’s state- 
ments (about stimulus magnitudes) no longer belong to the object language, but are on a 
par with the statements of another psychologist and thus belong to the metalanguage. 

Ross ends up, therefore, in a logical position which he could have arrived at without 
his prior excursion into measurement theory, a position in which his definitions of the 
logical status of subjective experience or the reports of such experiences in psychology 
necessarily determine what shall and what shall not be measurement. This book is, in a 
way, five "years too late; whilst Ross has been stating what is measurement, Suppes and 
Zinnes have made contributions which strongly suggest that the insistence on additivity in 
Campbell's view of measurement is not well put, and independently Goude has, аага 
found an empirical operation corresponding to numerical addition that can ре ae a 
the context of Stevens' psychophysical procedures. It is sad to reflect that if x i: a ү 
had not died early, before he had finished the demolition of Campbell, сары E E 
little known paper of his on Chance, then we might have been ен M. балон, 


confusion. 
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Pattern Recognition: Theory, Experiment, Compute 4 
Models Кто, Perception and Discovery. Edited by LEONARD Uur. London 
Pp. ix +393. 68s. (cloth), 45s. (paper). 


and New York: Wiley. 1966. m), же a 

This book on the recognition process in man and moine a oe = i 
published papers. But unlike most such collections, where ле pu id 
Similar methods to investigate closely related questions, it сең ете дее есектен 
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5 і i oncerned with the empirical 
mre ui а fur papal vie "Берік Bulletin 
Байташ fos ns ірі ^ Arnoult, and Hake. ‘The paper by Hake on 
Мын НАН ы ры шо сы is especially valuable as it is a very thoughtful 
form месте has been published previously only as а technical report. — 
us > of the other papers are important contributions to one of the ws. es oat 
паа бла ика мач дана маеннан Т ЧИН, 
i siali vever, as the area is so wide they e y y d жағала 
P ribs о ння Be s: could be covered, so the expert will probably not ық шей 
use of the book. Unfortunately, to the uninitiated they may well pue e гез -— Eae 
boring because they are not edited; and not only are some of them highly tec WC. x iind 
are also relatively unconnected, which makes interest much harder to sustain. phi ds qe 
have been remedied by greatly lengthening the introductory notes to each section, three 
hi ly just over one page long. . 
pis о. scabies das it mdi have been better if a single author € Бп 
persuaded to write a book covering the present topic; but this volume does serve a valua e 
function. Tt delimits a relatively unstructured interdisciplinary ficld and focuses pic! 
attention on a very important and almost totally unanswered problem. it is poky 
consequence of it being an edited selection of papers that one area which might well provide 


А рола atics of 
very useful lines of attack was not mentioned. This is the study of the mathematics 0 
pattern recognition; e.g. the work of Nilsson. 


Yet to have added mathematical papers to 
the others would possibly have been the 1 


ast straw for many readers. Т. SHALLICE. 


Contributions to Mathematical Psychology. Edited by NORMAN FREDERIK 
HAROLD GULLIKSEN. New York: Holt, Rinehart & Winston. 1964. Pp. 
545. 


This is a symposium of papers in honour of L. L. Thurstone presented when Thurstone 
Hall was dedicated to bim by the Educational Testing Service. "Тһе quality of the papers 
the academic distinction of the authors (who were his students or strongly influenced by 
him) and the different universities they represent form a magnificent tribute to 'T'hurstone 
himself and the spread of his influence. 


Five of the papers are mainly mathematical in their conte 
last, are not. The first, by Dorothy Ав 


n T ~ . . түз ^s 
с a kins, North Carolina, is an account of ‘Thurstone’s 
professional career, with a few sidelights 

haustive bibliography 
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chance conversation with colleagues after his appointment as Professor of Psychology at 
Chicago and not long before he published The Vectors of the Mind in 1935. 

An interesting feature of most of the other papers in this symposium is that they are 
more in the nature of progress reports on the work of the authors than final reports of 
limited research projects. So they provide excellent, brief and authoritative outlines of the 
authors’ approaches to their various subjects. 

Coombs, Michigan, summarizes his theory of data before going on to a non-metric 
multidimensional analysis of a conditional proximity matrix. The example he gives is a 
non-symmetrical square matrix in which each row and corresponding column refers to an 
American psychological journal, and the characteristic entry in cell i, j is the frequency of 
references to articles in journal j appearing in articles in journal i. He found the data fitted 
the requirements for a metric space well, with one dimension predominating, which runs 
from the Journal of Abnormal and Social Psychology and the Journal of Consulting Psychology 
at one end to the Journal of Experimental Psychology and the Journal of Comparative and 
Physiological Psychology at the other. 

Gulliksen, Princeton, gives a discussion of some of the 
analysing statements of personal preferences and follows it with an analysis of a massive 
collection of data from groups of students in Belgium, France, Germany, Italy and Texas, 
U.S.A., who were given four questionnaires concerned respectively with the evaluation of 
occupations, reasons for work, nationalities and goals in life. The obtained values were 
scaled and their means calculated for each group. Some comparisons between the groups 
are recorded graphically. ‘The data for Belgium and Texas were used for two factor 
analyses of the intercorrelations between the items in all four questionnaires combined. 
A rather striking finding was that there is little tendency for the rotated factors to have 
loadings on more than one of the questionnaires. Thus, for example, evaluations of the 
goals of serving God and gaining immortality, which characterize factor 6 in the тез 
data are reported to be unrelated to evaluations ©: 
clergyman which characterise factor 3. қ 

Tucker, Illinois, tackles the problem of extending factor analysis to three-way shod of 
(which he calls, rather confusingly, three-dimensional matrices) and provides a pete) 9 
reducing a table of the measurements of a number of variables ina number of cases i 
number of occasions to a smaller three-way table, or core matrix, which may peine 1 
capable of being interpreted іп the usual factor-analytic terminology. But as his examp 

f his results is hard to judge. 
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'The problem Шел by Horst, Washington, 18 one met in app el же 
to the items in a test or questionnaire scored dichotomously. It was ue le ecale if there 
examined the implications of the concept that items must habng EU = = or from least 
is any way of arraying them in order of bias, e.g. from easiest P nihil answer to an 
favourable to most, such that whoever gives a positive (correct ог cin 
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bias though “ we know that it is an essential part of the warp and woof of all fests aS 
of dichotomously scored items”. When the items may refer to more than one ЫЯ 2 s 
the status of some of the items in relation to some of the scales is open to doubt a urina 
say in the typical conditions where a multifactorial item-analysis is needed, the ane зд 
becomes more complicated. Effects of differences іп bias need to be disentangle = 
effects of differences in scales. Horst proposes a procedure to partial them out and obtain 
a residual matrix for factoring. 
Abelson, Yale, discusses changes in the distribution of attitudes that may occur when 

a controversial issue is discussed between members of a social group. He makes use of 
contact matrices, which are square matrices, say 4, with a row and corresponding column 
for each member of the group, and are symmetrical if the entry in cell і, j and again in cell 
j,i is the frequency of contact between members i and j. These off-diagonal entries, for 
ij, may be positive or zero but cannot be negative; d, 4 
i—j, may be taken to be zero if it is considered reasonable to assume that contacts of / with 
himself do not produce any change in his attitude. The initial attitudes of i and j towards 
the issue are assumed to be measurable as x, and ху, and the change in the attitude of í as a 
result of contact with j over time t is expressed by the differential equation 


the entries in the leading diagonal, for 


where aj; is the entry in cell i, j. 

This leads Abelson to consider A*, a derivative of A, with the same off-diagonal 
entries but with negative entries in the leading diagonal which reduce the row totals to zero. 
He analyses it into components with real bipolar vectors and negative roots, The vectors 
define the divisions in the group: contact is relatively close between members located at the 
same pole and slight between members at opposite poles; the first component (which has 
the smallest negative Toot) gives the major cleavage, etc. The initial attitudes of the 
members towards an issue will be the slower to change, the more closely the vector of theit 
measurements coincides with the major dimension of cleavage. The model implies that 
all the values of x con 


verge asymptotically, on a mean i.e that the me imately 
а \ ym y .e. mbers all ultimately 
Hi ie Provided there is at least one who is in contact, directly or indirectly, with 
others, <tendi i luci à 
M xtending the model by introducing many further 
The last paper, a literary one by Guilford 
Creative processes. He relates them to his 


Finally, Abelson Proposes e: 
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reference for a long time. The papers on subjects which are unfamiliar to me are the ones 
I find most interesting; the ones on subjects I have studied closely are relatively dis- 
appointing. 

Abelson's differential contact frequency matrix fascinates me with its very peculiar 
properties and I would have welcomed more information about his methods for analysing it. 
Coombs's conditional proximity matrix interests me only slightly less. The main attraction 
of both their papers is the prospect they offer of widening the scope of scientific observation, 
They provide methodological scaffolding from which new vistas for research can be seen. 
It would not matter much to me if the scaffolding had some day to be pulled down to make 
place for more solid structures built on firmer foundations. The possibilities they reveal 
аге their attraction. 

On the other hand, when considering the papers by Gulliksen, Horst and Tucker, 
I find myself more critical of the structures, not being so preoccupied with the view. Their 
methods seem quite unnecessarily elaborate and not very appropriate for their data. 
Gulliksen's materials are so vast that his paper, long as it is, does not explain fully how he 
analysed his data or show what they were before he began. І can see no advantages what- 
ever in the equamax rotation he used for his factor-analyses: product matrices such as his 
must be singular, provided the effects of response set are removed, so the communality 
problem does not arise and his procedure starts with a false assumption; moreover, a 
rotation which tends towards a positive manifold (there are less than 10 negative loadings 


in the 300 or so listed in his Tables 11 to 14) is inappropriate for expressions of preference, 
since they are necessarily bi-polar. I һауе described elsewhere the methods which I prefer 
(Slater, 1958, 1960). 

Horst's problem appears to me 


tions in a matrix with a row for each in 
of variance to eliminate the variation between rows and columns, 1 t 
from the interactions and analysing that in terms of its principal components (it, too, is a 
singular matrix, and any analysis introducing reduced communalities would be inappro- 
priate). | š 

For three-way tensors a wide range of possible methods of analysis are worth шет 
ing. Several аге being used in the M.R.C. service for analysing repertory кї T. e 
promising one, however, and the one which is most relevant to the problem « nih 
Tucker, is still undergoing development. It depends on finding a convenient v 
method for solving the equation 

T(o, n, m) =À . vo + Un + Vm + E(o, n, m). 


of a series of o matrices with z rows and m columns 
на using the same соп- 


each, e.g. a set of repertory grids obtained from the same =o, er B 
structs and elements on o occasions during the course of a рү се Р 
referred to а principal component specified by three vectors, uw ME tensor of residuals; 
the rows and v, for the columns, and a latent root, A. Eisath p the sum of the squares 
and the specifications for the component must be chosen to ouo ec ario 
of the entries in 2. Dr. J. C. Gower, of the D apartmen? e aid ‘when I proposed the 
mental Bison, found a feasible algebraic non m i is due to appear in Biometrics. 
problem to him in the spring of 1966, and his piper on ғ analysis to three-way tensors; 

This is the orderly way of extending principal pan anil rs by following the same rule. 
and it can be carried further to four-way and higher order kei deer ay tensors, obtained 
The problem is far from being academic: prensa be defined as a tensor with 
from p patients undergoing a course of отир дее pee vectors, including v; for 
a fourth subscript, p, and referred to а component s 4 minimized gum of squares. 
the patients plus а four-way tensor of residuals with a Evie E t den al 2, 
Singularities present no difficulties, which is fortunate since g 


tensor the more singularities must occur. 


to be one that might be solved by arraying the observa- 


formant and a column for each item, using analysis 
forming a product matrix 


lere T is a three-way tensor built u 
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с izati incipal nents analysis. Biometrics. (In Press). 

lizations of principal compo у à 
Fides d ere 1580), Te general relationship between test factors and person factors; 
xn lication to preference matrices. Nature, 181, 1225-1226. Sait panta qh 
SLATER, TETEE (1960). The analysis of personal preferences. Br. Ў. Statist. Ps) 
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Multivariate Statistical Analysis for Biologists. 


By Hitary E. Sear. London: 
Methuen. 1964. Pp.xiv+207. 45s. 


This book is written for the biological research worker who has little — of 
š Наки UN 
least squares and multivariate models. It has the aim of affording him “ a sound working 


knowledge of . . . Regression Analysis of a Linear Model " and of explaining “ Wes ШЕ 
statistical methods іп relatively simple language which yet omits none of the technic 
details ”. 


The role of the computer in statistical analysis is at once acknowledged and it 
is rightly pointed out that there is danger in the read 


y availability of programs for multivariate 
analyses, which may be used indiscriminately with only a dim notion of the underlying 
m are two sections, Part A entitled “А Single Dependent Variate " and Part B 
entitled “ Several Dependent Variates ”, In the first Chapter the reader is at once pofi- 
fronted by a set of data for which a linear model is developed, and the remainder of the 
chapter takes him through the analysis of this data in a conversational tone which 
characterizes the entire volume, The topics of least square estimation, vector notation, 
vector addition and multiplication, solution of simultaneous linear equations, determinants, 
and estimation and distribution of residuals, constrasts, and regression coefficients are 
treated within the first 25 pages. 


The second chapter defines orthogonality of vectors and 
derives four linear models for a further set of data, It is shown that orthogonality of the 
instrumental variates simplifies computation, which 
users, but it is not m 


seems a slight virtue for computer 
entioned that this minimizes sampling variance of the regression 
coefficients. The third 


chapter deals with analysis of covariance, and the fourth, misnamed 
“ Multiple Regression ”, really deals with matrix algebra up to тпа 
Part B, on multivariate analysis, commences witha chapter on multivariate analysis of 
variance, The following chapter outlines princi nts and introduces eigenvalues 
and eigenvectors and the multivariate normal distribution. The next chapter, entitled 
Canonical Analysis ”, covers what is usually referred to as discriminant analysis and has 
nothing to do with canonical correlation, ity of covariance matrices is 
yer ее, | ter is a muddled treatment of factor analysis and will 
Beto of жаз "lar with this subject, situs 
йыры ыры ы.” of this boo vhat is omitted, especially 
` st of the difference of a mean vector 
t ut a known covariance matrix. „Тһе 
analysis, "Тһе multivariate Bel, Шу treate than, say, discriminant 
; or proportionality of опе 


ofa f references for the over-all subject: 
multivariate Statistics, For ce oe treatment of either least squares theory 5 
) А . 3 e, there is no refe Wawa lae «tbook: 
his Work contains Several blunders in my егепсе to Anderson's 1958 text 
Simultaneous Jine. numeri 


cal analysis, The solution of three 
is described for « the insight it gives US 
Solving such equations ", "Тһе solution for 

ristic polynomial is outlined and only in the 
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appendix is the reader referred to th i 
published in 1 e Jacobi method by way of a paper i 
ксі а ба Ec computer centres will e E серле pias 
bce amine baci n ega- Vilkinson method described by Ralston in 1965.) plage 
besides bee ы. ҚА egomarc the iterative solution for үне: е. еі 
that пе ые а i ong of these a Se сада ea FREES шуа 
sgt tates ansa cand im не reader with only “ a one semester course in basic statistics ” 
teer oras pne pp understanding of the models described. ‘The author's 
vici in cues UA € е d eceptive and there is a danger that the indiscriminate use of 
back. Toi aiia қон» will be encouraged rather than discouraged by reading the 
M dime d s E н of when models and tests are inappropriate due to, 
жеткені денеде pan eoe or sub-asymptotic sample sizes. On page 115 the 
waa еі » Lee ga ratio criterion is employed with an N of 24. 
байсал атқ «Жаз g^ bra fulfilled, it would have been a valuable aid for 
апа numerical analysis It an көтек; api € irme € 
р трас тт . s h er, have many sets of illustrative data—these are 
nly biological features. It may therefore be useful as supplementary material 


for a more 
К complete text. 
ext J. О. Ramsay. 


Introduction t isti 

о Statistical Ideas for Social Scientists. Ву С. Клілох У 
Chapman & Hall, 1966. Рр. 58. 6s. pa 9 
This is a short primer in which the author has attempted to provide a broad review of 


statisti i H e 

иа zt explicit aim was to provide an introduction to statistical thinking 

knowindge of Е р of detailed computations. The reader will probably acquire some 

бена basic statistical notions and will learn to compute basic descriptive statistics. 
testing procedures, including chi-square, Students f, product-moment correlation 


rag ge һү —— ее опе variable analysis of variance, are introduced. However, 

ы... eee s aim that this book alone should prepare an otherwise uninitiated 

лла Ns Esas ese procedures. "There are several references to more advanced texts. 
istica ables are provided. 

It is this reviewer's opinion that if 

competently, he will need access to more 


o learn statistical testing procedures 
texts provid ss to advanced texts than this one, and that many such 
Dowd өт | good a grounding in fundamental concepts as does this book. ‘The aim to 
likely А early stated introduction to statistical ideas is praiseworthy, but would be more 
жк, 6 шесе if greater space had been devoted to it. If supported by an appropriate 
iiem o» this book may be very valuable to the social science students for whom it was 
sho Ñ Ж lowever, it is quite possible that the main value of this book will be to students 
und ave already learnt some simple proceduri 

nderstanding what is accomplished by their use. 


a student is t 


cs but who have continuing difficulty in 
D. LEGGE. 


y. ByH.L. ZETTERBERG. 3rd Edition. New 


On Theory and Verification in Sociolog 
Jersey: Bedminster. 1965. Pp.177. 40» 

‚ This is an elementary and highly readable introduction to theory building in the social 
Sciences. For the third edition a new chapter on definitions has been added, and the text 
now touches lightly on a whole range of theoretical structures—from simple inventories of 
findings to complex axiomatic systems and computer simulations. In addition, the relation 


9f theories to facts is discussed. 
M of the book is concerned with simple on, 
qualitative statements arc subsumed under, or deduced from, other qualitative 
e to go beyond existing textbooks, or to criticize the 


st А 
atements, No attempt is mad 


* covering law ' types of exploration, in 
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i с Ç. telling points, 
i izi Y be. Butthe book makes some 
y kinds of theorizing they descri t m 4 
н “a d the kind of textbook which contains a mass of bus e ше 
particu ms E never integrated with one another. It is well supported 2 і dir А 
€ i he kind of ‘ propositional so 
i i rate image of the kir a! 
terials, and it conveys ап асси; j ) › Re 
eel que an increasingly influential group of American sociologists. B 


Memory Abilities: a Factor Analysis. By H. Рат. KEL 
Monograph Number П. 1964. Рр. хі+ 53. 53.00. 
This monograph contains a re 

cribes as “ relatively immediate int 

term memory’. The purpose of the study 


LEY. Psychometric Society: 


rs, eeds 
ple the factors postul 


ated and also differentiate between them. Realizing 
that this battery of tests is likely to me: 


asure abilities other than memory, he supplements it 
5. Air Force Airman Classification Battery. The 
final battery of 40 tests was administ 


s obtained are identified as verbal 
Their extraction may be thought 


» and once they ғау three factors which the author 


meaningful memory 


and span memory emerge. These are three of the 
four memory factors postulated: they appear to be relatively distinct fron: each other and 
to be independent of whether the material i 


to be convincing, 
All this sounds fairly satisfactory and in a sense it is, 


Yet on ге: 
value of factor an 


ading the monograph 
dies seem to well up- 
1964, the actual study 
Statistical Procedures used are 
he study is confirmatory rather than pilot in nature, and 
Postulated to emerge from it, a competent factor analyst 
zero and non-zero loadings for the data and proceed to 
This would eliminate the natural doubts one 
n hypothesis testing. However, 
say that this is an interesting monograph 
as well as by Psychologists interested in 
А. E. MAXWELL. 


t although the m 
s earlier, As а 
dated. Since tl 
tors have been 


This із essentially a * h. 
in the social Sciences, 
designer of a 
haustive ”, 


doit 
It Tepresents a геа 
questionnaire, « It is not meant to be 
Тһе strong point 
the individual item 
As a possible ©“ 
and sociology ” i 


its emph 
uestionn, 
dents a 


to make upaq 
9r research stu 


е scale. 
limitations, pa 


. . y 
graduates in social paychalego 
area of attitude measurement: 


құ 
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Тһе discussion in Chapter Five of the co L 
t : T 
m ый chapter of the relationship oe 5. be — m he account in 
erhaps these are two arcas where the writer has exerci s and behaviour is very inadequate. 
more esoteric controversies ". А full dis jon о: pm заш SIME 
à E cussion of thi T 
andixosearch warker in socal peyoholoey, dnd ese two issues is vital for the student 
in poem who is concerned with ihrer im cur ss е7 о (e 
| ‘The attitude scaling procedures outlined i i rom sisnow ledge ОЕ attitudes, 
Likert and Guttm: а p / MT buon gre the traditional, Ташев 
— Tr 
tec iniques are linked together in a ¿hai “ Miscell pane arki Repertory Grid 
includes soci : 5 а рїег оп Miscellaneous Techniques ” whi 
Pi on i epe roses and brand-images, and diaries. ductor kia 
jective techniques, ision of a section on “ experiments ” appended to the chapter on pro- 
last 2. Scale appears іп its original 1925 version and the 
The йе E hers errs Р selected readings outlining more recent research on the topic. 
би. ds дн әжені ас the end of each chapter often appear to make up for known 
by reference to od әде сы paragraph of text on response scts is supplemented 
ШАЛЫС Sa P: six se ected readings. This is particularly surprising in the light of the 
4 5 claim to have kept the number of references to a minimum. 
6 the ot mm methods are approached ina refreshing way through a full discussion 
IBS work w OE questions and the writing of attitude statements. The importance of 
еа п in testing the wording of individual questions and the optimal sequencing of 
ГА tenet ig amc is stressed throughout the book. The examples come from a 
"There e к though mostly from surveys in which the author has been involved. 
А eee oe \ gs on the use of projective techniques in attitude study. It would 
бана > Е a pful here if the author had given some guide lines for the coding of 
atitide seg! Па Ға. similar to his excellent step-by-step outlines for the development of 
nil as m dmittedly this is rather a tall order, but there are a number of books and 
ntent analysis which are likely to be of more use to the practitioner than the 


Present account. 
adds Гһе final chapter deals with the quanti 
i ining of a set of standard operating proce 
ex С 
xt supported by a full-page diagram of a blank punch card. 


Once again the 
elpful than a lot 
В. М. Farr. 


fication of questionnaire data. 
dures would have been more h 


1966. Pp.300. 63s. 


people who are concerned 
t when he discusses 


London: Nelson. 
with oe for “ professional test constructors and for the many р‹ 
the mea 1gence and other psychological tests , Dr. Anstey 15 at his bes "erm eed 
applied tent of test items and the many faults that may be discovere: ; y e g m 
E d po analysis. Many examples are provided and their detailed — Г a : 
SVident and perhaps unique contribution. As for the rest, a serious lack of scho arship 

in most areas. The theoretical shortcomings аге too numerous to discuss in any 


lr bri a few pointers will suffice. 
the oa the title is misleading. As Dr. Anstey ackno 
Elenco. is about item analysis and intelligence tests. 
respects E indices for item analy to which a good de: 
methods ыы Гһе availability of computers has r 
terms of Sheree professional test constructors. Moreove соп 
reference their mathematico-statistical properties is а pointless € 
is made to a test-theoretic model. 


Psychological Tests. By EDGAR ANSTEY. 


wledges in the Preface, most of 
Secondly, the discussion of the 
al of space is allotted, is in most 
emoved the need for * short-cut 

г,а comparison of the indices in 
xercise when no explicit 
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Thirdly, there is a strange if not musty flavour to the references quoted on many 
topics. On item analysis we are primarily referred to Burt (up to 1945), Guilford (1936, 
1941) and Vernon (1945, 1948). Оп corrections for guessing in multiple-choice tests the 
main reference is to Davis (1951). (It is not good enough to say that sufficient has been 
written on this topic and then to give an inadequate and even misleading summary of the 
problem). On reliability we are referred to Adkins (1947), Guilford (1936) and Thorndike 
(1951). Оп assessing the value of tests, the important contribution by Cronbach and 
Gleser (1957) is not discussed although it is listed in the Bibliography. 
regression we are referred to Weatherburn (1946) and an unpublished note by Maund (1951). 
On the test-theoretic definition of reliability, Thorndike (1951) is credited rather than 
Spearman, Kelley or even Gulliksen. 


г ! even с On item analysis by computer only Ziegler (1954) is 
mentioned, T he brief index lists Socrates, Inigo Jones, decor of office premises and 
similar oddities, but not Spearman, Thurstone, Gulliksen 


š , aptitude tests and test utility. 
АП у all, Dr. Anstey has a good eye for the practical points, but the scholarship displayed is 
weak. 


Рнплр Levy. 


On multiple 
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COMPUTER PROGRAM ABSTRACTS 


Program for Fact i y M. WYATT, 

А ip олда oe анч Кы M. Wyatt, Department of Education, University 
escription ‘Thi rr: саггі іпсі 

жара ae eq Va eie Peine „ра тас са Тарт үзді a 
simple structure. The intercorrelati f th licae factors ake елесті e а 
оо а elations of the oblique factors are analysed for higher order 
е. аа ru t order factor being given in terms of the original variables. "Тһе 
Tx к he program may be either correlations or raw scores. In the latter case 
^ stanc ard deviations and correlations are computed and printed out. ! 
‘he maximum number of variables acceptable to the program will be approximately 


2 
120 to 130 on the Manchester University Atlas. 


паа a Data Format The program has been designed for use on the 
š аа r sd Atlas and is written in Atlas Autocode. The program is on 7-hole 
ноба roa ata input may be either 7- or 5-hole paper tape or punched cards. А ааға- 
“ee stem is incorporated in the program for use when starting from raw scores, а 

ject's scores being deleted from the computation if they are found to be at fault, ‘The 


nee of the fault and location are printed out. 
The program will analyse a number of separate set: 


at the commencement of the data. 


s of data, the number being given 


d in one minute. For 
les requires approx- 
5 variables requires 


iar glad Times Most average size analyses are completed 
ГАК. ae of a data matrix of 100 subjects scored on 30 variab 
Se sec. Analysis of a data matrix of 200 subjects scored on 4 
i imately 90 sec. 
er Program tapes and directions for use are available from the author at the 
T artment of Education, University of Manchester. 
For use of the program on the Manchester University Atlas, enquiries should be 


ad : : : : рл. 
"Mis ice to Head of University Computing Service, Manchester University, Dover Street, 
Manchester 13. 


Reference 
HENDRICKSON, A. E. and Wurte, Р. O. (1964). Promax: A quick method for rotation to 
oblique simple structure. Br. J. statist. Psychol. 17, 65. 

i FORTRAN Program for Canonical Correlations. By S. HUNKA and M. WAHLSTROM, 
D University of Alberta, Edmonton, Alberta, Canada. 

» escription This program performs a canonical correlation analysis of tw 

he described by Cooley and Lohnes (1962) and Anderson (1958). | 
meee and Configuration The program is written in FORTRAN IV for use on the 
th M 7040 having 32K core storage. No tapes are used. Variable formating 1s used and 
Tj total program is composed of 13 subroutines. | 

Че Three problems having 170 observations, each with a tot 

red 1 Š о ame A 
aia of computer time. This estimate 


го sets of variables 


al of 11 variables, re- 


includes calculation of correlation 


e either raw data or the appro- 


may b 
able for re- 


must be on cards, but r | ‹ 
broutine is made avail 


ation su 
f variables. 


In : 
Mi ive, Options, Output Input 
* correlation matrix. A data-transform 


arrangj à 
ging raw data to form the required sets 0 
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у; raw data аге used, the means 
Various options are available for the data to be output. If : data are used, th an 
dard deviations ani i trix are o Regardless of whether raw data or a 
stan iati d correlation matri utput. d eu dat r 
correlati H the partiti i t. In addition to the deter- 
i ix is i t, th titioned matrices are output. i 
on matrix 15 Input, t E ad á RU 
minant of the correlation matrix being output, inverses of the partitioned TE PE апа 
intermediate calculations required to make symmetric the matrix for which roots and oe e 
are ге i - i re eigenvalu eigenvectors are found 
i ¿ tances where eigenvalues ani 1g 
quired are also given. In all ins е с veg Sa "a 7 
they may Бе outpu anonical correlations and both sets of normalized weights are 
е! b tput. C. 


output and properly labelled. Bartlett's Lambda test is used to test for the significance of 
the canonical correlations. 


Availability Copies of the source de 
Division of Educational Research Se 
Edmonton, Canada. 


ck or listings may be obtained by writing to the 
rvices, Education Building, University of Alberta, 


References | 

Coorzy, W. W. and Lonwzs, P. R. (1962). Multivariate Procedures for the Behavioral 
Sciences. New York: Wiley. а 

ANDERSON, Т. W. (1958). An Introduction to Multivariate Statistical Analysis. New York: 
Wiley. 


A FORTRAN Program for Nonparametric Statistics, 
of Alberta, Edmonton, Alberts 


а, Canada. 
Description This program can 
described by Siegel (1956) and by the Application 
Scientific Subroutine Package. Тһе follow 
(1) Rank correlation coefficie 
p or Kendall’s т). 
(2) Any one of the following: 
(а) Chi-square test between any two variables, categorized at Previously 
Score points, 
(b) Cochran’s Q-test betsy 
(c) Mann-Whitney U- 
(d) Friedman’s Two 


By J. E. Cartson, University 


be used to calculate various nonparametric statistics as 


Directory of the IBM System /360 
ing statistics can be calculated in a given run: 


А x À à , 
een all pairs of variables (either Spearman’s 


specified 
een dichotomou: 


S variables (minimum of three variables). 
test between two s 


Way Analysis of V. Pecified groups оп each variable. 
.. "ay Analysis of Variance between matched groups. 
(e) Kruskal-Wallisan alysis of Variance between Specified groups on each 
variable, 
Wilcoxon Matched-Pairs Si š irs of 
gned Rank Т Қағып ^ pairs O 
variables, est between all possible pa 


/ депи ine Package 
: SRA " pus Scientific Subroutine eu 
WTEST, RANK, TIE Akne A NK, CHISQ, QT EST, UTEST, ‘TWOAV (modified), 
Language 
ІВМ 7040 сш >. K So s наны ің FORTRAN IV for use оп an 
i j 5 of core memo кка: ; ired. 
pin ы 5 уагі ч n tcl шіге 
n; num of 200 subjects and 45 variables can be handled by un ше йз кестелі form: 
variables and 35 to 40 bj the machine and the amount of data. Fifteen jobs of three 
5 Subjects were run in less than 23 min. оп a machine with off-line 


— — 0€ 
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Input, Options, Output The input сап be from cards ог from magnetic tape and data can 
be either ranked or unranked (a ranking subroutine is included), Transformations can be 
made on variables, and new variables can be generated as functions of input variables by 
adding appropriate FORTRAN statements into a data-transformation subroutine. 

The data are preceded by a title card, a parameter card specifying the amount of data 
and the types of statistics desired, and a variable format card. Certain of the significance 
tests require an additional parameter card to be read after the data. 

Any number of jobs may be stacked on a given run and execution is terminated by a 
blank card following any job. ‘The output varies with the statistics desired. I 
zlvailability Listings and copies of the source deck are available from the Division of 
Educational Research Services, University of Alberta, Edmonton, Canada. 


References 

SIEGEL S, (1956). Nonparametric Statistics for the Behavioral Sciences. 
McGraw-Hill. 

ІВМ “PECHNICAL PUBLICATION 
Scientific Subroutine Package. 


New York: 


DEPARTMENT. Application Directory, System/360 
New York: White Plains. 


Factor Scores Programs. Ву Н. J. HatwortH and ANN Ввевмев, School of Edu- 
cation, The University of Birmingham, and College of Education, The University of 


Calgary, Alberta. 


Description А program using the regression method to estimate factor scores forms part of 
а system of programs already abstracted in the Computer Section (Hallworth and Brebner, 
1966). The present program is complementary in that it employs certain formulas for 
component scores derived by Kaiser (1962). It may therefore be used to obtain scores 
from Principal components, or from components which have been rotated to either an 
orthogonal or an oblique position. | 

‘The maximum number of variables and components for current versions of the program 
is 50, but the number of subjects whose data can be analysed is virtually unlimited. 
able in Autocode and FORTRAN versions 
For the Autocode programs, 
aper tape with standard 
input of both program 


Computers, Languages Тһе program is avail 
Suitable for running on the KDF-9 or Atlas computers. 
Input of program and data may be by means of either 5-hole p: 
Autocode characters, or by cards. For the FORTRAN programs, Ps 
апа data is by cards. АП current versions use а variable format facility. 

FORTRAN versions of the program will also be made available for the IBM 360 series 


of computers. 


ин Format АП versions of the programs аге a 2: 
data being indicate Ë inning of a run on the computer. | : | 
he words COMPONENTS, o ORTHOGONAL FACTORS, or OBLIQUE 
Further control words 


.. The words COMPONENTS, or ied 
FACTORS are used to indicate the type of scores required, Epis nn] 
regulate the type of output: scores may be punched on tape oF eor tig the means and 

It is always necessary to input a matrix of principal pergens subjes seoves on 
Standard deviations of all variables for which scores are required, ап ese ii dio required 
these variables In the case of rotated factors, a matrix о аайаркот сс lations between 
For OBLIQUE FACTORS it is also necessary to use the matrix of correlation 


the fact 
ors, ра 
те of all the principal components of 


The program does not require the u 
Matrix, 


1f-regenerating, the number of sets of 


a correlation 
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Computing Times 


Times will vary considerably according to the language and computer 
used. 


In most cases the actual computation time will be relatively insignificant and the 
over-all time will be dependent primarily on the amount of input and output required. 


Availability Copies of the program tapes, card decks, listings and a full description of the 
program are available from the authors. 


References 


HALLWORTH, H. Jk and BREBNER, ANN (1966). Systems of computer programs for 
multivariate anaysis. Br. 7. math. statist. Psychol. 19, 137-138. 
Kaiser, H. F. (1962). Formulas for component scores. 


Psychometrika 27, 83-87. 
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Absolute brightness thresholds are а suitable experimental situation. for 
testing theories of temporal integration. Both the physical and the proximal 
stimulus can be described mathematically and this is the situation where two 
of the complicating factors normally involved in studying the visual system— 
adaptation and lateral inhibition (see Barlow, Fitzhugh and Kuffler, 1957)—can 
be most safely ignored in the early stages of theorizing. In addition, it is better 
understood, both physiologically and psychologically, than most perceptual 
situations. 

One objection to the use of absolute brightness thresholds is that the 
traditional explanation of Bloch’s Law—that the threshold energy of a flash is 
independent of its duration for flashes lasting less than a critical duration—is that 
it results from integration of information in the retina (see, e.g., Graham and 
Margaria, 1935). Since Bloch’s Law is the best known phenomenon of temporal 
integration in vision, this indicates that temporal integration in this situation is a 
peripheral effect and so may not be typical. ‘This objection is not, however, 
very strong. In the first place, temporal integration in other detection situations 

might be peripheral as well. Secondly, suggestions have been made quite 
frequently in recent years that in this and similar visual situations the locus of 
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assumed that the transmission lags from absorption to the accumulator are 
variable; so removing the over-restrictive assumption of Shallice (1964). The 
distribution of transmission lags will be called the lag distribution. For most 
of the paper the lag distribution will be considered as a confounding variable 
obscuring the effect of the integrating mechanism. In the Discussion, Black- 
well’s theory, where the two roles are reversed, will be considered further. 


2. CHANGE Detector MODELS 


Before discussing the models, the assumptions common to all three will be 
Specified. Figure 1 gives a flow diagram of these common properties. Some 
of these assumptions are made to fit widely supported physiological theories; 
Others, on issues where there seems little or no physiological evidence, so as to 
make the models as mathematically tractable as possible. As in all model 
building it is necessary to compromise between usefulness Se 
too many assumptions make prediction very difficult and corroboration ws y 
Meaningless, too few make corroboration impossible. Here, most of the 


assumptions err on the side of simplicity: greater specificity would require 
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greater physiological knowledge. Some of the predictions made will not gepard 
on all of these assumptions and the relation between the assumptions and the 
predictions will be clarified in the Discussion. | | 

1. In the transmission of information from the stimulus to its internal representation, 
temporal integration occurs at one level only. . 

This precludes, for instance, a system in which a retinal ganglion cell 
integrates input and its output is integrated by a higher mechanism. 

2. Each photon arriving at the eye has an equal chance, v, of being absorbed and so 
triggering an input to the accumulator of a constant number of impulses in a short 
burst, after a lag which is independent of the lags of other inputs. 

For simplicity it will be assumed that this constant number is unity, since 
this makes the lag distribution easier to conceptualize; the deductions to be made 
are, however, compatible with any constant. 'lreisman's (1964 b, 1966) theory 
about the variability in burst number will be discussed in Section 10. 

Тһе idea that the absorption of a single photon іп a rod can lead to neural 


activity is a necessary consequence of the statistical arguments of Hecht, Shlaer 
and Рігеппе (1942) (see Brindley, 1960 b). 


3. An accumulator receives input only from the field 
Decisions are based on the output of individual accumulators. 

This is to account for Ricco’s Law (see Brindley, 1960 b), and for the fact 
that Bloch’s Law breaks down more sharply for larger fields (Bouman and van 
der Velden, 1947; Barlow, 1958; Baumgardt and Hillman, 1961). This can 
be explained by probability summation over different accumulators (see Section 
9). 

4. In addition to the photon-produced impulses arriving at an accumulator there are 
assumed to be noise > impulses—probably produced by the spontaneous degeneration 
of rhodopsin. These are indistinguishable from the photon-produced impulses and 
occur at a rate corresponding to x absorbed photons per accumulator. 
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; y psychophysical results (Barl 1 57; 
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This model gives the prediction that if one does not know when the ‘om 
will occur the chance of detection will depend, to a close approximation, on Ақ 2 
the energy in the flash and not on Ив duration. This is not the case, so 
model will not be considered further. 


3. THE CONDITIONS FOR TESTING THE MODELS 


Experimental work on temporal summation and absolute brightness oa 
olds may be divided into two parts according to the length of time the үл 
or flashes last. First, there is the work done оп short flashes mainly to tes 


Bloch’s law of complete temporal summation. Experiments since 1940 have 


included Bouman and van der Velden (1947), Long (1951), Bouman and van 


den Brinks (1952), Davy (1952), Rouse (1952), Barlow (1958), Baumgardt and 
Hillman (1961), Blackwell (1963) and Sperling and Jolliffe (1965). Second, 
there is the work on the detection of long flashes, say longer than one second. 
Although a number of experimenters have worked on flashes lasting up to about 


one second, the only work on long flashes known to me, which is sufficiently 
detailed to be of use for testing quantitative predictions is that of Bouman anc 
van der Velden (1947). 


The reason for making this division into short and long flashes is connected 
with the difficulty of dealing with the lag distribution. If the flash is long 
compared with the spread of the lag distribution, 
lag was assumed independent of all other transmis: 


input becomes a Poisson input to the accumulator 


except at the beginning and 
end of the flash duration. 
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others used either the method of limits or the adjustment method, which are less 
satisfactory (see Osgood, 1953), but it is not obvious how systematic alterations 
to the threshold could occur. 

If, as Barlow (1956, 1957) suggested, absolute brightness thresholds should 
be considered as signal/noise discriminations, then the subject could lower his 
criterion at the appropriate time, if he knew exactly when the stimulus was due. 
This has been shown to occur with absolute thresholds for electric phosphenes 
and with difference thresholds for both light and sound intensity by Howarth 
and Treisman (1958) and Treisman (1964 a). If this effect occurred for absolute 
brightness thresholds and if the subject knew exactly when the stimulus was 
due to start, a subject could lower his criterion for a short flash and so distort 
the comparison of short with long flashes. Bouman and van der Velden give no 
details of their experimental procedure, so it is not possible to assess whether 


this artefact could have affected their results. а | . 
A third possibility suggested to me by A. H. Gregory 1s that if the subject 


does not fixa xation point rigidly, all the light from a flash may not fall 
on the same di us um Then the threshold may be raised as ere 
may not fall in the field of the same ganglion cell. It is known -— a d 
complete areal summation occurs up to at least 10’ in the periphery co ш 
and van der Velden, 1947; Barlow, 1958; Baumgardt and Hillman, у $ 
that only eye movements of that order will affect the results. However, à = 
Armington and Ratliff (1954) showed that when subjects are fixating, = aie 
ments are much smaller than this. Іп а 500 msec period, the image һа 
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50 рег cent chance of moving more than 2. Since summation times are of я 
order of 100 msec, fixation difficulties should have а negligible effect on the 
results. However, for foveal thresholds where areal summation is over a much 
smaller area (see Brindley (1960 b)) this artefact could be important. This is 
one reason why this paper deals only with peripheral thresholds and, except 
in Section 9, with flashes of very small area. 

Experiments on the effect of duration on the absolute threshold normally 
show their results in the form of a relation between the threshold energy (N= IT) 
and the flash duration (Т). This curve, which will be called the N/T curve, 
typically shows an upward kink in the region of 100 msec (see Figures 2 and 3); 
d*N|dT* is high in this region. This kink is equivalent to the breakdown in 
Bloch’s law with increasing duration. Since the interpretation of short flash 


уг value) 
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Каша 3. Baumgardt and Hillman’s results for the relation between flash intensity and 
duration for small diameter flashes (3’ 26”. Note that the intensity is plotted 
Increasing downwards. (From J. орг. Soc. Amer, 1961, 51.) 
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section. - For the GMA model no reasonable explanation of the kink has so 
far been produced, as will be shown in the next section. 

Since the complication introduced by the lag distribution does not affect 
long flash predictions, for this condition the models will be examined to see how 
closely they predict existing frequency of seeing curves, which is a much more 
stringent test than that of explaining the short flash kink. 


4. Ѕнокт Frasu THRESHOLDs—GMA MODEL 


It has previously been shown, using a continuous analog 
(Shallice, 1964), that it is not possible to explain the short fl 
assumed that the lag distribution has zero variance. 

This may be because the GMA is not a suitable model for visual thresholds 
or just because an unreasonable value was chosen for the lag variance. In this 
section, how the GMA behaves when a more realistic value of the lag variance is 
used will be examined in order to assess whether the previous result was just a 
Consequence of the unreasonable lag variance used. This entails making some 
estimate of the lag distribution. To do this the theory which has been of most 
help is that of Mueller (1954). He proposed that every absorbed photon 
immediately changes one molecule of the absorbing substance to a second 
substance. This second substance subsequently goes to a third state and in so 
doing triggers an impulse in the bipolar cell and hence the ganglion cell, provided 
It is not in a refractory state. Mueller assumes that this second change of state 
is analogous to radioactive decay, in that the time between the first and second 
Changes of state can be described by an exponential distribution. He shows 
that the theory can account for previous ganglion cell recordings of the Hartline 
(1934) type without assuming, as Hartline did, temporal integration 1n the 
T ommatidium. on 

ueller's theory satisfies the present axioms anc 1 sugges : 

the variance is due А delays Aol a the system, then the lags can В be of 

у an exponential distribution plus a constant (т). Using this as E z ed 
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Now Gregory (1963) has shown that for a semi-infinite flash starting at 
ф=—т: 


ІМй-а-(а-а) севә, (3) 


ge number of photons being absorbed before and өле 
is Mueller’s constant. In our example 4-0, q=», 


where qo, q, аге the avera 
the flash started and с 
therefore 


2 = x vI(1—e-ct) — by 

so that 4) 
у= (*vI)[b —vI е-“|(Ь—с)+ K есы, ( 

Where K is a constant. 

The steady state 


evel of the accumulator with no flash acting is given 
when #—co and 1-0, 

ie. y=x/b. 
when the flash is about to Start, y=x/b. Hence from eqn. (4), 


penso + E) 
whenz=0. So that generally, 
y (x--vI)|b —vI(b e-ct с e~ bt) /b(b — c). 
Now if the flash lasts for a time T, then until = Т-т, eqn. (3) describes 
the generation of impulses at the receptors. Also, 


since the second term 
decreases with increasing 7, the maximum value of у(4) in the range 0<t<T is 
attained at z= Т. Therefore 


Therefore, 


(5) 


HT) = (x +1) /b—vI(b eer ç e-b?)/b(b — o). (6) 
When і>(Т-т), we are interested in the out 
flash has ended. i i 
D) + (а —Ф)(1—е-ет) сеен, 
From eqns, (1) and (2) we have 


z =x+vI(1 -ег-е?) ect Бу. 
Therefore I= — a/b +01 6797) е-е (b — c) 
where К” can be obtained from eqn. (6). 
Hence у) = a/b еса) e-etje_(y _ ет) ett [bc (b — c). 
Now for detection, Ly(t)]max > C, a constant, 

By differentiation it can be shown that Ly(t)Imax occurs when 

(1 eer) e-ct — (1 =r ba) eb. 

Das ајда e tT) 0-0 b(1 — е-ътусгф-), 


T K' е-и, 


Therefore 
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If I and T are vari 
varied so th i 
id NT) shold Le at [»(Z)]max is constant, the relation between Т 
where у= C [b ВИИ Eda) e O a 059, 
= C —x[b is constant and N i 

Do кү) ів cor is the flash energy. 
ИР ea pendis 1 it will be shown that whatever eaka of E, b i 

n = a good fit to empirical N/T curves. рае: 
Pm безе suggests that the previous finding that the GMA model 
Fea оқ es NJ SH curve was not just a result of choosing an на ые 
аа 4 beer ы since the present analysis is not based on the proper 
eae d ut on a continuous analogue of it, the evidence is not 


5. SHORT F 2 
RT FLASHES—PERCEPTUAL MOMENT MODEL PREDICTIONS 


In i ; š 

ieee ы. ы уйа кышы ш while it is obvious that the perceptual 
are features of the N p let a kink in the N/T' curve about where T=D, there 
(рёжопы] conim V| curve which are hard for the model to predict. U. Neisser 
peedtered em th unication) has pointed out that Bloch’s law itself cannot be 
the familiar ty perceptual moment model. He says, “ It seems to me that 
only if one coul I would be produced by a quantal (moment) mechanism 
But this can e sure that cach flash began at the beginning of a moment. 
linearly dime be ensured, hence the moment (instant) of incidence must be 
саве there will pus across the quantal interval. It is easy to show that in this 
the ares ren ea parabolic relation between the duration of the stimulus and 
reached. "Th ergy it delivers which does not level off until two quanta are 

e observed linear relations are thus refutations of the moment 


theory,” 

Bloch’s law holds exactly it seems 
1 moment model. However, the 
ivocal. Some experiments find 
Before reviewing the evidence, 


This is a v 
to be a poate very strong argument and if 
experime ecisive refutation of the perceptua 
i ntal evidence on this point is not unequ 


1t holdi 

3 i i 

iË S е” amiy others only approximately. 
advisable to obtain an estimate of the perceptual moment model 


Predicti ^ 
extend iv, Neisser’s parabolic relation holds for average energy, but to 
Variance and ney thresholds requires two assumptions—that there is no lag 
unction, It ү for any length of flash the frequency-of-seeing curve із а step 
^E variance ; as not proved possible to obtain a quantitative prediction when 
assumption is allowed for, but it has proved possible to relax the second 
I a 
lator ge is little or no lag variance then, considering t. 
together ; à very short flash, the impulses should arrive at the a 
'Mpulses т time and hence most likely within one moment. 
сё Mene RU within any moment has to be greater than а 
Blven inte on, but the number of impulses arriving during th 
nsity of flash depends upon the Poisson variability o 


he input to the accumu- 
ccumulator close 
The number of 
certain criterion 
is moment for а 
f the number of 
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i iability of the dark noise. Hence, as 
bed and the Poisson variability o ° 1042), 
a «ан es reasons, and tested by Hecht, Shlaer = ЕМЕСЕ 2 
i dis Velden (1944) and Barlow (1956), = poc | о bus вя а унй 
i е 4). 
hort flashes should be Poisson sums (see igure 4 24 
da pueri receives further support from an argument in Section 


тот 


ы o Ф 


Frequency of seeing 


"m 


Та 16 1:8 20 22 2-4 
Log N 


irenne (1942) (dots) and van der Velden (1944) 


Ficure 4, Data from Hecht, Shlaer and Pi E 
d by theoretical curves from Barlow's theory. (From F. opt. Soc. Amer. 


(crosses) fitte, 
1956, 46.) 


› it will be assumed that the freq 


i uency-of-seeing curves have 

the following form (where B, C are Constants) 

0 iflnN<c 

ln N-c : 
9 B-C ifC<InN<B, 

1 if B< Inn 
That this linear relati erat! nction 
can be seen from Figure L Шы. Approximation than a step fu 


Не ГТ 
1 I an initia] estimate 

of th 
assumed that the flash is not detected ; 


arts 
ume that the larger of the tod 
е relation between N and T it 
n the smaller of the two parts. 
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probability of the fl i 
ash b i i 
of Big ber won е аена in the larger part is that of a very short flash 
In(/2-C . 
н “Foe if С< (1) <В. 
Now for а flash of len 
== dor gth T, where C «In (IT) « B, the ch i i 
n, taking into account the variability in the vat ter of пала p Mais 
Ox D-T [In (IT)— C] " 2 Тіп (0) -С 1 
Р p= p: 25 Bh ss (2 
ere X = тах[7/2, exp (C)/1]. | 
Now if In (IT)> C+In 2, then Х=Т/2 and from (7): 


р- 
Tin (IT)-C] = » [t In (It) -t С". 


| e-go- 
T 
inn (B-C)Q=In (IT)-C-T(1 = 2)/D. (8) 
ence for constant frequency of seeing, the following relation must hold: 
(9) 


log IT=K+0-13T/D, 


Where K is a constant. 
This relation holds for 0 < T < D provided: 
Ë (i) Іп N<B when T->D and (ii) In N—In2>C when 7-0. 
ae the results of the experiments mentioned above, (B— C) seems to lie 
ughly between 0-6 and 0-8 log; units. 
Considering relation (ii), 
In ([T)- C» (B-C) if O>4 
>In 10x 0-6 
=In 10 x log 2 (using log 2—0-3010) 
=ln 2. 


Hence relation (ii) holds provided 0>05. 
n (IT) - 0-13 In 10. Therefore, 


From eqn. (8), C+(B-C)Q=! 

In (IT) - 0-13 In 10- Bg « (B-06 In 10(1-0) 
-В-061һ(1-0). 

_Q) > 0-13 In 10, i. Q<0-78. 


; d ly. Since 
between 0:5 and 0-78 approximate y. 
5 the 50 per cent threshold and Bouman 


nt threshold, this range covers all the 


S 
Ps In (IT) « B if 0-6 (1 
жы» both relations hold if О lies 
and method of limits presumably gives 
Nor: van der Velden used the 60 per ce 
mally used criteria. 
for КЫ are five modern experiments 
results a. small area peripheral flashes. ^P, 
Whether í) presented graphically in a a = vi < T<D 
or eher the moving average model Ie Қаблан. негі WH 
qn. (9) fits best. Another (Rouse, 1952) was principally cone 


[T curves 
1958) the 


known t 
ble to tell 
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for 0<i<n, of the rth moment of the distribution of the time to absorption 
from state 7, then: 
(I-T)m® =)... -(—1)0*9mO. 
Let р be the rate of input to the accumulator (=/1 ) апі Л be the rate of 
decay, which is equivalent to the death rate. Then, if k is small, 
(I-T)/h= 


0 
ш —p 0 v. dto T Я 

=} (+p) —p 0 - oA Ж . 0 
D A р po. . à 0 
Ü 2 Ü =G- pte- p 
0 ы” 


-(n-1AÀ p+(n—-1) | 
с 


Consider Т” = F і an (п+1) x (2+1) square matrix, where C is the 2 x1 


column vector [0,..., 0, y]. Now T’ is a finite irreducible Markov chain with 
only one absorbing state and no other persistent states. Hence the probability 
of absorption = 1 (Feller, 1957). 


"Therefore, using Pike's formula for the mean absorption time, 
(I- T)m9 =m — 1. 
If U; is the mean number of units of time, 


h, from state ; to absorption, 
0,-0,-1; 


: | (10) 
0а ++), uUa y —1 (l<i<n—2); (11) 
7 (17 AUG. a (a (n— 1))Us y= 1. (12) 
For the first (n — 1) equations, the difference eqn. (11) together wi 
condition (10) has the solution kaa ukasa TEN 
E M e on 
imo POR (деті 
Define U; by the equation. 
“(4-1 aol +4]U р, (4% 
. А (4-1) HUn 
Then U, satisfies the difference eqn. (11). Therefore 
Uy=U, "OM sa ?! 
But from the ath j-o KO SH 
ut trom the nth eqn. (12) as well as eqn. (13), we have Un=0. Hence 
ai М ач „ 
U fi = Ti 
10) д P [riy as h0 (14) 
Геп Hoopen (1966) has obtained by 


ә” 


Temporal Summation and Absolute Brightness Thresholds 145 


present case »=vJ+x where v is the proportion of arriving photons absorbed 
by the rods, x is the rate of production of ‘ dark noise’ impulses and А is the 
unknown constant of the GMA process. 

Тһеге are two reasons why it is not possible to use this relation to test the 
GMA. First, consider how the distribution relating frequency of seeing to 
flash duration varies with changes in the intensity of the flash. Eqn. (14) shows 
how the mean of the distribution should vary, but all the experimental results 
show how a particular percentile of the distribution varies and one cannot be 
deduced from the other. An exception to this generalization is provided by the 
frequency-of-sceing curves found by Bouman and van der Velden (see Figure 6). 
However, their curves show how, for fixed duration flashes, flash energy and 


100 TWIN, t, d) 


2 5 10* 2 5 105 2 
Қ -of-seei for 
Р der Velden’s results for the frequency: of-seeing curves 
кйш 9 Бошай аа P мді (d), durations (4) and energies (М). The thin lines 


саре of edictions from their two quanta theory. (From 7. opt. Soc. Amer. 1947, 
are e pr 
37.) s.p. 


K 
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hence flash intensity is related to frequency of seeing, but what is required is how, 
for fixed intensity, flash duration is related to frequency of seeing. This 


could be roughly estimated from their results, but the mean depends upon all 


parts of a distribution, including the tail, which cannot be satisfactorily estimated. 

The second difficulty is that, even if a curve could be found relating the 
empirical flash intensity to the mean flash duration mentioned above, to assess 
whether eqn. (14) was a satisfactory description of it would be no simple matter. 


The equation contains four unknown constants v, x, A and z so that, even ifa 


reasonable fit were obtained, it is doubtful whether it would be particularly 


meaningful, unless some alternative method could be found of estimating the 
constants. 


7. Lonc FLASHES: MOVING AVERAGE PREDICTIONS 


It has not proved possible to find an analytic expression for the chance of a 
long flash being detected on the moving average model. ‘This is not surprising 
as the moving average process is notoriously intractable (see Naus, 1965). 
However, it is reasonably easy to simulate the process and since the basic 
parameter—the period—can be estimated from short flash results, the simulation 
does not take too long. Only three constants need be varied in the simulation: 
T, the length of the flash, а the average number of impulses arriving at the 
accumulator in a period, and т the criterion. Using Р- 100 msec, M(a, n, Т) 
was obtained by simulation where М(а, n, T) is the chance that a flash of 
duration T, average number of impulses arriving a, and criterion п would be 
seen. For given values of N and Т from Bouman and van der Velden's results, 
the frequency of seeing Q can be found (see Figure ба). For these values of 
Q and T and for a fixed value of the criterion z it is possible to find a value of a 
such that 

OQ - M(a, n, T). 
Let My? (Q, T) be the value of a which satisfies this equation for the previously 
fixed values of Q and T. Now from the axioms, a=(vJ+x)P where v, x, P 
are constant. "Therefore, 
n (0, T) -(I--)P. 

The value of J for particular values of Q and T is given by the data and therefore 
this equation can be tested for each point on Bouman and van der Velden's 
frequency-of-seeing curves for any given value of n. Figure 7 shows the plot 
of Муг! (Q, T) against I for 2, 4, 6, 10 and 15 using Bouman and van der Velden’s 
data for the 1 sec and 5 sec flashes and the 60 per cent points for the 1 sec and 
не 27 It can be seen that a reasonable linear fit is obtained, especially 


„А second check on the model can be obtained by comparing the obtained 
straight line with deductions from very short flash experiments. Define the 
tail of the lag distribution as the area not covered by the central 100 msec. If 
the lag distribution has negligible tails, all the impulses from a very short flash 
will contribute together to the moving average; there will not be time for the 
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^ Moving Average 
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ғ 
T 
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ulse Arrival Rate 
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Light Intensity 


FicunE 7. Straight line predictions from the moving average model for different values 
of the threshold, n, fitted by eye to Bouman and van der Velden's long flash data. 


to have been in the moving average for more than 100 msec 
before the last impulse arrives. Ifa is the average number of impulses arriving 
at the accumulator from the flash added to the average number of noise impulses 
іп a period апа Ра(и) the probability of its being seen with z the criterion, then: 


Pa(n) = > ау e-a[yl. 


first of the impulses 


yn 
"Therefore dP|da = е“ an-1|(n— 1. 
So Pa(n) = J “eu un ](n — 1)1 du 
0 
-Га(т)/Г(а), 


where Га(л) is the incomplete Gamma Function. 
The incomplete Gamma Function has been tabulated by Pearson (1922) in 


the form G(s $2) where s, =ају/п and s,=n—1. For апу value of О and fixed 
a can be found such that Q=P,(n). Let G*(Q, 1—1) be this 
value of a. Consider first С-Қ06, п-1). From this, using the values of v 
and x obtained from the slope and intercept of the long flash line, the energy of 
the flash giving the 60 per cent threshold for a very short flash can be estimated 
from a=vN 4 «P. For n —2, 4, 6, 10, 15 this is found to be 52, 46, 38, 36 and 

Тһе empirical value obtained by Bouman and van der 


ectively. 
vds gc ide m was 31 quanta. So the prediction for the 60 per cent 
threshold is fair for the higher values of 1 but poor for the lower values. 


n, a value of 
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Instead of considering just the 60 per cent threshold it is possible to consider 
the frequency of seeing curve for very short flashes. Figure 8 shows G-1(Q, n —1) 
and M (Q, Т) both plotted against J for n=10. Since both lines should have 
intercept x and slope v they should be the same. They are obviously not. Thus 
the moving average model for higher values of z is not satisfactory. 


^ Moving Average (n=10) 
1 
| 


-r => 
10 20 30 40 50 
Light Intensity ` 
Ficure 8. Impulse arrival rates derived from Bouman and van der Velden’s long and 


short flash frequency of seeing curves usin i 
r ‹ g the moving average i - 

The short flash points should lie on the straight line which has yl DNE 
through the long flash points. SOE 


а. values of 2, say n=2, the difficulty for the model is that the 
oe Бу ше in a short flash is much lower than would be predicted 
pelle g - results. If the opposite had been true it could be explained 

y ing that the tails of the lag distribution could not be неет but 


there is no obvious ex i 
planation of the actual result. Alteri i 
. . I £ 
not er rien since then the position of the kink i ee 
impossible i 

р to explain. (It would be necessary to assume iod of i 
mately 35 msec in order tte tap Hen 


С — 
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be less steep than the short flash line in Figure 8. It therefore appears that the 


moving average model cannot provide a satisfactory explanation of long flash 
results. 


8. Lone FLASHES: PERCEPTUAL Moment MODEL (Poisson МЕтнор) 


For the moving average model, the probability of a long flash being seen 
was used to estimate the average rate at which impulses arrive at the accumulator 
and this should be linearly related to the intensity of the flash. In addition, 
results from long flash experiments could be compared with results from short 
flash experiments. A similar approach can be used with the perceptual moment 
model and proves much simpler since analytic methods can be used throughout. 

On the perceptual moment model a long flash has an equal and independent 
chance of being detected in any moment during its time course. If 0, T, a are 
defined as in the previous section and Ра(п) is the probability that a flash 
producing average input a will be seen in any moment, then 


1-Q-[1 - Ps(n)]??7. 


^ Perceptual Moment 
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ival Rate 


n m 
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s т > 
10 20 
Light Intensity 
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9 Straig! ine predictions from the perceptual moment model for d 

FIGURE 2 s «capped n, fitted by eye to Bouman and van der Velden's long flash 
value s M, 


data. 
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However, as before, 

Pa(n)=T a(n) /T(n) = G(a]A/ (n), n — 1). 
Therefore G(a//(n), n—1)=1-(1-Q)?!?. 

Let the value of a satisfying the equation be G-(Q,2—1, Т), then we 

should have 
V(n)G-(Q, n—1, T)=(vI 4-3)D. 

Using the values of Q, T, I used for fitting the moving average model, the 
existence of the linear fit between J and G-1( ) can be checked. Figure 9 
shows the relation between a/4/n and I for n=2, 4, 6, 10 and 15. @/4/n is used 
instead of a as the ordinate because the standard deviation of G-( ) is 


approximately proportional to ул. Reasonable straight line fits are found for 
all values of z. 


As in the previous section, a second check on the model can be obtained 
by considering long flash frequency-of-seeing curves in conjunction with short 


^ 


Perceptual Moment (nz10) 


ise Arrival Rate 
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Pom у Кре arrival rate derived from Bouman and van der Velden’s long and 
аза irequency-of-seeing curves using the perceptual moment model with 


n=10. The short flash points sh i i i i 
epulae i bud Rus "à е lie on the straight line which has been drawn 


open circles. Short flash results are again indicated by 
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flash curves. However, the assumption that all the impulses from a very short 
flash summate in one moment is not so reasonable as for the moving average 
model because, even if there is a small lag variance on some occasions, some of 
the impulses from the flash will arrive in one moment and the rest in the next. 
In the following section, evidence will be presented that this effect is not large. 
Even if it is large, the energy threshold predicted from the long flash results will 
provide a lower bound to the possible values of the energy. That is to say, the 
theoretical value cannot be greater than the empirical value. For n=2, 4, 6, 10 
and 15, the predicted energy for the 60 per cent threshold is 40, 33, 31, 31 and 
30 quanta respectively, compared with the empirical value of 31 quanta. For 
low values of the criterion, the estimate is too high so that it cannot be explained 
using the idea of lag variance. For higher values, the fit is good. Also, the 
lines derived from short flash frequency-of-seeing curves are reasonably similar 
to the lines obtained from long flash curves. An example, for 2 = 10, is shown 
in Figure 10. Тһе lines are much more similar than they were for the moving 
average model. Therefore the perceptual moment model appears to fit reason- 
ably well for a criterion of not less than about 4 impulses per moment. 


9, Томс FLASHES: PERCEPTUAL Moment Mone (Brinptey’s METHOD) 


Whilst the previous methods enabled predictions to be obtained for small 


area flashes, the axiom system cannot deal with larger areas where more than 
Тогеохег, since there is strong evidence that 


one ganglion field is involved. М 
ganglion cells have overlapping fields and are therefore not independent (Hart- 


line, 1940; Brindley, 1960 b), simple elaborations of the previous method do 
not seem possible. ү. 

However, Brindley (1960 a) proved a relation which enables generalizations 
to be made about the effect of duration for large area flashes. Moreover, with 
this method, no assumption need be made about the relation between а and J, 
previously assumed linear. He showed that if the 1/Т curve is of the form 
Тїп = c for a particular frequency of seeing Q = Qo then the frequency-of-seeing 
curves for long flashes, assuming no temporal summation between moments, 
must be of the form Q=1— exp ( —kIm) where k=T/c{In1/(1—Q,)]- This is 
proved by deducing what the frequency-of-seeing curve for one moment must 
be and hence obtaining the curve for many independent moments. 

Brindley (1960 b), also using the results of Bouman and van der Velden, 
showed that the results for long flashes of small area where TI?—c for 60 per 
cent frequency of seeing, produced frequency-of-seeing curves of the predicted 
form. Brindley’s approach can be extended to large area flashes provided their 
I/T curves have an appropriate form. 

From Brindley (1960 a) we should have 


О-1-ехр (—RIm). 


Therefore log In (1—0) = т log T+log k. (15) 
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But for 0-3 < Q «0-9 the following relation holds to a close approximation (see 
Shallice, 1965): 

j i Q= ilog In (1 — Q)1 4- 0-59. 
Therefore from eqn. (15) we should have: 


Q=3m . log [+ Шор k4-0-59. (16) 
i i гі іп the range 

Hence frequency-of-seeing curves should be linear w ith log I in the rat 
0:3 <0 509. This is the case for all areas of flash (see Figure 6). Іп addition, 


the slopes of the (Q, log 7) lines can be measured and compared with the shape 
as predicted from the I/T curves. 


If the N/T curve is of the form NT = C, as it is in all cases to be considered, 

then it has the equation: 

log N=tan 0 log T4 c,. (17) 
T[G-tn 97 — exp (сі). 

Let the Q/N curve have equation: 


О —tan ¢ log N +c. 
Then from eqns. (16) and (17), tan $= т = (1 —+їап 0)-1. Therefore 
cot ф —4[3(1— tan 0). 
Since the N/T curve has probably less error variance than the individual 
N/Q curves which are only frequency-of-seeing curves (see Figure 5), 0 will be 


used to estimate ф which can be measured directly from Figure 6 for both 
1 sec and 5 sec results. 


A comparison of estimated and empirical values of ф 
is shown іп Table 1. 


Hence 


TABLE 1. A Comparison OF THE ESTIMATED AND EMPIRICAL VALUES oF ф 


The tangent of ¢ is the slope of the straight part of the curves in Figure 6, 


using Brindley's method 
Area Empirical 0 Estimated m Estimated $ Emp.d Emp. ¢ 
(1 sec) (5 sec) 
4’ 25% 1:9 54° 52° 54° 
38” 299 22 599 60% 65% 
213’ 36° 38 71* 69% 67° 
1390” 38° 48 749 759 77” 


Тһе fit can be seen to be good except for the 38”, 5 sec curve and possibly 
for the 213’, 5 sec curve and is certainly within the margin of error for frequency- 
of-seeing curves. 


| Finally, it is possible to estimate what the average energy arriving within a 
single moment must be to give a 60 per cent chance of detection. ‘This can be 
compared with the 60 per cent Point for very short flashes to see whether the 
input from a very short flash arrives all in one moment or is spread over a 


ғ 
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number of moments—an issue that was important in previous sections. Let 


P(N’) be the chance of detecting the input from a flash of energy N' all of whose 
input arrives at the accumulator in one moment. Therefore 


(1—P)- (4 - Q7 = exp [- (D]e) (In (1 — Q) 1 Ha], 
so that log In (1 —P) — log N' — (m —1) log D 4-log k. 
Comparing with eqn. (15), we have, when P=Q: 


"nt-— 


log N—log N’ = 1 (log T—log D). 


т 


But from the N/T curve we know that, for large 7: 


m—1 
log N = —— Я 
og N = log Tc 


Hence log N’ = ( - 7) log D-4-c. 


If it is assumed that all the impulses from a very short flash arrive at the 
accumulator within one moment then the value of D can be read off from 
Figure 5. It is where the asymptote of the N/T curve has energy equal to that 
of the shortest flash. From rough measurements from this figure, the following 
estimates of D were obtained for spots of diameter 4’, 8’, 16’, 38’, 76’, 213’, 
663’ and 1390: 81, 86, 97, 132, 139, 86, 107 and 94 msec respectively. 
These estimates agree reasonably well with the value D = 100 msec used through- 
out. Thus the idea used earlier that lag variance may not greatly influence the 


predictions receives tentative support. 


10. PERIPHERAL 'l'HEORIES 
Treisman (1966) has shown quite convincingly that the response in an 
absolute threshold situation is determined centrally. However, it is conceivable 
that temporal summation could take place at a more peripheral stage of the visual 
response determination. Yet, both the plausible places for temporal 


system than | I 
0 occur—in the rods and іп the retinal ganglion cell—seem to have 


summation % 


drawbacks. | Р 
If temporal summation were to occur in the rods, this would mean that the 


output of the rods would be the same for two flashes of the same energy provided 
that both lasted for less than the critical duration. However, it has been shown, 
using statistical arguments, that the perception of a flash at absolute threshold 
depends normally on a number of rods each absorbing a single photon, provided 
that the flash has diameter greater than about 10’ (Hecht, Shlaer and Pirenne, 
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; 1956; Brindley, 1960 b). If Bloch's law holds for this area of 
ra Ta not be aoled by summation within individual rods. In 2 
in the first experiment where the effect of area on Bloch's law was investigated, 
Graham and Margaria (1935) found that whilst temporal summation held up % 
100 msec for 2’ diameter flashes, it only held up to about 30 msec for 1 
diameter flashes. However, later experimenters have found that it held almost 
as well for larger area flashes (e.g. 1° diameter) as it did for flashes of the order 
of 2' to 4’ (Bouman апа van der Velden, 1947; Barlow, 1958; Baumgardt and 
Hillman, 1961). Baumgardt and Hillman have produced evidence that Graham 
and Margaria’s results may be distorted by the use of uncalibrated wedges and 
that such an error is consistent with the results. In any case the weight of 
evidence suggests that temporal summation occurs at a higher level than the rod. 

That temporal summation occurs at the ganglion cell seems at least equally 
likely, especially as it is known that Ricco’s law of complete spatial summation 
holds for fields of about the area served by a single ganglion cell (Bartlett, 1965). 
For a larger area, spatial summation seems most likely to be probability sum- 
mation (Bouman and van den Brink, 1952). This would mean that the brain 
is not combining the outputs of more than one ganglion cell from an eye in 
order to make a decision. "This idea receives some support at least for the 


lateral geniculate nucleus where Bishop, Levick and Williams (1964) found 
that a third order neuron normally recei 


ves excitatory input from only one 
ganglion cell. 

How could the ganglion cell o 
summation? Assuming that it o 
then each impulse arriving from a 
across the synaptic gap which in 
synaptic potential (EPSP) in the 


perate so as to be the sole locus of temporal 
perates in the normal fashion for a neuron, 
bipolar triggers a burst of transmitter substance 
turn produces an increase in excitatory post- 


' ) ganglion cell (see Eccles, 1964). The EPSP 
decays exponentially until the next burst of transmitter substance arrives. If 


just after a burst of transmitter substance arrives the EPSP threshold is reached, 


the cell fires either once or, possibly if the EPSP decay is slow, more than once. 

Now if the EPSP builds up to its threshold v. 
the time the flash is acting, then tem 
must occur at a higher level. 


Kuffler, Fitzhugh and Barlow (1957) 
to be the case, the EPSP would have 
impulse could be produced. Howe 
e.g. Renshaw cells, they are rare (s 

In addition, Ten Hoopen (1 
process in which the approximati 
values instead of being continuou 


have demonstrated in such cells. For this 
to decline very slowly so that more than one 
ver, while cells with slow EPSP decays exist, 
ce Eccles, 1964), 
966) has shown that an immigration-death 
On 18 used that the EPSP takes only integral 
5, provides a satisfactory model of nerve fibre 
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firings. The rise in the EPSP Produced by an impulse arriving along a 
Synapsing fibre is taken as the unit. This model is mathematically identical to 
the GMA so the predictions made from it would be the same as those made by 
the GMA. They have been shown to be rather unsatisfactory in Section 4. 
Thus the retinal ganglion cells do not seem likely to be the sole locus of temporal 
summation. 

More conclusive than these arguments, which depend on extrapolations 
from present-day physiological knowledge, is some psychological evidence. 
Matin (1962) found that if two flashes are presented to corresponding points 
on the retina the chance of detection is higher for inter-flash intervals up to 
100 msec than would be predicted if probability summation alone was operating. 
Whilst, as Matin points out, some of the preceding work on this problem had 
failed to detect summation, this seems strong evidence that some temporal 
summation is occurring centrally. 


11. Discusston 

The arguments of the preceding section make it implausible that temporal 
summation can be explained entirely peripherally. However, the axioms of 
Section 2 are not the only hypotheses consistent with the arguments of the 
preceding section. Three other such hypotheses are those of Baumgardt (1953), 
Treisman (1964b, 1966), and Blackwell (1963). 

Baumgardt suggested that a flash is not detected if several quanta are 
absorbed in a sensitive unit, but only if a number of sensitive units each absorb, 
nearly simultaneously, two or more quanta. This could easily be fitted in with 
the arguments of the preceding section by assuming that the EPSP threshold 
of a ganglion cell is between one and two times the rise in EPSP produced by the 
effect of one quantum absorbed in the ganglion’s field. The integration of 
two-quanta coincidence would take place more centrally. It seems very difficult 
to work out what the predictions of this theory would be, taking into account 
the complications discussed in this paper. Baumgardt only discussed the 
predictions of his theory with respect to spatial summation, so that as far as 
temporal summation is concerned his theory remains at present uncorroborated, 
unlike, say, the perceptual moment model. . 

Treisman suggests that the effective photons absorbed trigger not one 
impulse in the ganglion cells but a train of them. However, Treisman, in the 
development of his theory in order to fit Barlow's (1957) findings, assumes that 
at low light intensities the variability in the pulse trains produced by each effective 
photon is negligible. So the extra complication of assuming variability in the 
pulse trains would not affect the predictions made in the present paper. E 

Blackwell's theory, as discussed in Section 1, is conceptually almost totally 
distinct from all other work in the area with its emphasis on the lag а 
rather than summation. It provides a good fit to his data for short equ es 
in order to fit the long flash data he has to introduce a second assumption whic 
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is equivalent to a moment model. This is because he phrases his age) 
cnc of a continuous variable instead of using a stochastic formulation. 4 E š 
the moment model alone will explain the results, there seems no a. с 
rather stringent assumptions about the form of the lag distribution. y 


In Section 9 no assumption is made other 
than that the Change Detector oper: 


ates on the perceptual moment model. It 
is found to fit the data well. 
In Sections 7 and 8 most of the predictions are critically dependent on the 
axioms and here the perceptual moment 


(1) That complete tained at absolute threshold for up to 
100 msec flashes and in p 
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APPENDIX 1 


Тнат THE GMA Dors Nor Fir SHoRT FLasH Curves 
It was shown that the GMA predicts that the N/T curve for short flashes 
should obey the following equation: | 
(1-е-Әту/0-с) 
на NES Tero where E, 0, c are constant. 
Let b — rc, therefore I 
NOT) _ (1—ecrezyut-) (1-е-ет)(е-1) 
МТ) (eye Х (1 Teen 
( 14 e-renv (r-1) 
i (1+ ecez)me-ay 
(1 “-ағ)У(-1) 
= (1--ау”(-1) 


Empirically, this fraction remains very close to unity and then increases rapidly 
before becoming relatively stable. Is this theoretically possible? 
š N(4T) N(2)T + 
Consider Ex a =, ff dimer 
« an- [vg 1] is - 1] 


= (1 + атум(ғ-1) 1 2 (1 Jar r- T і 


dary — (1--ау74-У 7 


It can be shown by the method of approximations that, for any value of 
7, as al, (Т)->4 and as а->0, £(T)—1. By computer exploration of the 
function it was found that dg/da>0 for 0 «a «1, so that for 0<a<1, &(T) «4. 
Тһе data which has the least sharp kink and is therefore most likely to fit is that 
of Bouman and van der Velden. Using their data and putting 7—50 msec, we 
have g(T)—4-5. Ву itself this would not necessarily mean that their data could 
not be fitted by the model, since the supra-maximum value could be attributed 
to slight errors in the values used. This is plausible since 2(7) is related to 
d?N /4Т? and is not therefore likely to be very accurate. 

However, for the empirical data, we find g(2T)=1 when T=50 msec: that 
is, the N/T curve is linear in the region 100-400 msec. Yet from the model, 
if g(T) is large, 2(27) must also be large. Table 2 shows for a given value of 
&(T), what is the lowest value that g(2T) can take for any possible value of a. 
This makes an explanation in terms of slight errors less likely. Хо doubt by 
slightly increasing its complexity the GMA could be made to fit the data, but 
it is not now an obvious choice for a model of brightness thresholds, 


, Where a—e-c7, 


TABLE 2. ‘THE RESULTS OF A COMPUTER EXPLORATION оғ £(2T) For 
PaRTICULAR VALUES or g(T) 
. 3:5 8:25 3:0 25 
g(T) 3:75 5 
min [g(27)] 3:10 2:48 2:05 1:75 1:37 
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APPENDIX 2 


Tue EFFECT ON THE PERCEPTUAL Moment MODEL or INTRODUCING LAG SPREAD 


Consider two perceptual moment models—M, the one with zero lag 
variance considered earlier, and also M’ which includes lag spread. On model 
М” for a flash of energy Л/у lasting from —7 to —7+h, let the chance of an 
impulse produced by the flash arriving at the accumulator between ¢ and ż + ôt be: 

МАҢ) dt if 0<t<L(L<D), 
à (18) 
0 otherwise, 
where 7 is the minimum transmission lag and ХО) is а non-negative function 
In . . L 
determining the lag spread and satisfying | f(t) dt=1. Hence the input to the 
0 


M' accumulator from the flash will be the same as the input to the M 
accumulator (with zero transmission lag) from a flash whose intensity is 
described by eqn. (18). Hence, since the system is assumed linear, the effect 


of lag spread may be investigated by using the zero lag variance model but 
with a changed stimulus input. 


Consider four possible inputs to model M. 'Two represent the input to 
model M itself—a very short flash (Г) of energy N and a flash of (F) of length D 
a uh N í ae с two represent the corresponding transformed inputs 
о model M’—a flas described b «EE i 
[ies de 3 y eqn. (18) and a flash (F,) described by 


4 
N' NOD, O<t<L; 
X(t) = Ур L«t«D; 
+ Б : 
N OUD рҳ<а<р+г; 
otherwise. 


Let t be the time from 
when the flash s 
and let J(t) be the most ener tarts to the next moment boundary 


moment. Throughout this secti 


ergy moment and also that 7—0. This second 


Define B N Be Gi L. М/@) а, [ N f(t) ar] >0. (19) 
s -z= | fw dx dt. 


J «(t - В) =J (t). (20) 


Then if 25 L, 
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- Ift<B, 


0<7,4-В)-/Л()<2” ji | S(t) dt/D < 21N 170) -J3(2]/NDD. (21) 
Similarly, if B «2 « ZL, ° 
0<J,((—B)—-J,() <2(L — DN'[7,(0) —J,(9]/ND. (22) 
Consider Q=[9(N)— Q.(N)] -[0.(N’)-0,(N")], 
where Q,(J) is the probability of seeing flash Fi. Considering variations in the 
moment boundary: 


0- |040, 2-049, 9) -to., t- B) - Qo] yi. 


Since J(t) -J(D +t), inserting z— B instead of z rearranges the contents of the 
integral without affecting its sum. Using the mean value theorem since it 
may reasonably be assumed that Q is a continuous function of J 3 š 


o- J, poem), аел (0) Да 
where J*(t), J**(r) lie within the range: 
Js) <J*() <J,(); 7/0 <J**(t) xJ(t—B) for all ғ, 


Therefore 
D dQ dQ 
9» f о-о (9) -ae-m- (&), а 
40 — s fdO Ең 
where (2)... = min (2) for N2xJ«N; 
40 д а0 , , 
(m. = max (Z) for N'|2 <J < N'. 
40 Оу - 
74 ЖСЖ 
Since Q is a monotonically increasing function of J, «>0, (% >0. Тһеп 
Jaw 
d L 
9» (29) шр) [tto 7m - 4-2) 1) 
J=J' 0 


since if ¿> L, J,(t)=J,(t) and Js(t— B) -J (0) from eqns. (19) and (20). Тһеге- 
fore using eqns. (21) and (22), 
a2tN” 4 
ND | i 


0> (S)... up) f : (ШЕЛ П- 
+ [u= Е = |} P 


This last expression is necessarily greater than zero if ND»2aBN' and 
ND»2«N'(L—B). Assuming that L and В are such that O(N) is at the 
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desired frequency of seeing, let log N' —log N+0-13 so that Q,(N)=Q.(N’) 
from Section 5. Since Q»0, Q,(N’)>Q,(N). 

Hence if N” is the intensity at which Q,(N^) =Q,(V) then 

log N” «log М” «log N+ 0-13 
so that the difference between the threshold energies for ¿=0 and t=D is 
sed by introducing a lag. и 

= рш holds if ND SBN and ND » 2aN'(L — B), i.e. if B < ND|2aN 
and L—B < ND[2aN', both of which depend on а. œ can be estimated from 
the empirical data, e.g. Barlow (1956), but because it depends on the slope of 
the frequency-of-seeing curve only a rough estimate can be made. For the 
60 per cent point « is quite close to unity. If one assumes that the frequency- 
of-seeing curves are Poisson sums, then a more accurate estimate of а can be 
made, but it depends upon the criterion z. Considering the 60 per cent 
frequency of seeing for 1—2, 4, 6, 10, 15, «=1-39, 1-21, 1:16, 1-11 and 1-10. 
Taking «=1-2 as a maximum (since z;—2 seemed unlikely from Section 8), we 
have that the proof holds if B «0:33 D and L— B «0:33 D. 

However, examination of the proof shows that it contains a series of 


inequalities each of which is normally quite pronounced so that the result 
probably holds for a much wider range of lags. 
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NON-PARAMETRIC TREND TESTS FOR LEARNING DATA 


By А. R. JONCKHEERE and Gorpon H. BOWER 
University College London Stanford University 


Many current stochastic models of learning imply stationarity of portions of 
individual response sequences. Non-parametric tests are presented for detecting 
overall group trend and homogeneity of individual trends in a sample of protocols. 
The protocols may differ in length and consist of any responses that take ordered 
values, such as errors and successes, ratings, or response latencies. ‘The power of 
the test for overall group trend is discussed in relation to both linear and logistic 
alternatives. 


1. INTRODUCTION 


Recent developments in mathematical learning theory have revealed the 
need for statistical tests of trends in response sequences. Such tests are of 
importance when applying the one-stage or two-stage all-or-none learning models 
to empirical data (Bower, 1961; Bower and Theios, 1964) since the strongest 
parameter-free prediction of these models is that there will be no trend over 
certain portions of the response sequences obtained from individual subjects. 
Examples of this might be the sequence of trials prior to the last error, or the 
trials lying between the first success and the last error; and these predictions of 
stationarity hold whether success-error sequences, or response latencies are 
considered. 

We shall propose a statistical test for the detection of such trends which is 
powerful, and applicable to all cases known to us. In the following, though we 
shall focus mainly on short error-success sequences obtained from learning 
experiments, the results are applicable to sequences of any length consisting of 
responses that can take ordered values, e.g., latencies, response amplitudes, or 
confidence judgements. 

In learning experiments, the response sequences which are examined for 
trend have the following peculiarities. First, the performance measure often 
has only nominal significance, as with success-error sequences, and has to be 
correlated with a second variable, the trial number, which has only ordinal 
significance. Second, the composition and length of each protocol are usually 
random variables, different subjects having differing numbers of successes and 
errors, and different trial numbers for their last errors. Hence classical regres- 


sion analyses (e.g. orthogonal polynomials) are inapplicable, even if their 


restrictive assumptions could be met. 
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2. THE VINCENTIZED CHI-SQUARE AND ITS PROBLEMS 


Of several tests for stationarity of success-error sequences examined by 
Suppes and Ginsberg (1963), the Vincentized chi-square was recommended as 
being the least biased by subject and/or trial-selection artifacts and is now in wide 
use. In this method, each response sequence is divided into B equal, non- 
overlapping, trial blocks, and the number of successes and errors tabulated in 
each block. The 2x В contingency table obtained from each protocol is then 
summed over all protocols, and the null hypothesis, that the proportion of 
successes is the same in each Vincentized trial block, is then tested by the chi- 
square statistic, with B — 1 degrees of freedom, computed from this overall table. 


We shall discuss the deficiencies of this test before proposing an alternative 
method. They concern both the Vincentizing procedure and the chi-square 
test. "The procedural problems of the Vincentizing method have been discussed 
elsewhere (Hilgard, 1938); here it will suffice to note that no rational grounds can 
be provided for deciding what B should be, nor for deciding what to do with 
any protocol whose trial-length is not divisibleby B. Current practices are both 
arbitrary and often involve discarding data. Further, the rationale of the 


Vincent method seems doubtful in the light of current models for learning. 


For it is assumed that all subjects have the same deterministic learning curve, 


differing only in a rate parameter, and that their curves could be superimposed 
by uniformly stretching or contracting the trials-scale for each subject. How- 


ever, in view of the descriptive success of recent learning models, using either 


subject-controlled operators or Markov Chains, this rationale is extremely 
questionable. 


"There are also several deficiencies in the use of the chi- 
test for trend. First, it is applicable onl 


tailored tests, 


The basic statistic 


the above criticisms, s Š (Kendall, 1355), answers al 


werful for detecting trends; it is 


c 4 measure has nominal, ordinal, ОГ 
ene ; the homogeneity of indivi jects’ 
as well as the group average trend: е Я 
3 ie and t the 
Vincentizing method are атыды D he questionable assumptions of 
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3. THE Test PROPOSED 


Notation We suppose that a subject’s response on each trial can be assigned 
to one of c>2 categories which can be strictly ranked in some a priori way by 
the experimenter. Examples of ranked categories would be success-error (c=2), 
Success-error in combination with one of five confidence ratings (c—10), or 
response latencies (e.g., c= 150 if that is the number of distinct latencies a subject 
produces in a sequence of trials). The most frequent, and intuitively natural, 
ranking of the c response categories is in terms of their relative accuracy of 
performance: correct responses are better than errors, correct with high confi- 


.dence are better than correct with low confidence, and fast responses are better 


than slow ones. 

Considering the ith protocol in a sample of N, let n; be the number of trials, 
and це the overall frequency of response category k, where i= 175%, Мапа 
R—1,...c. Later, when we refer to a general protocol and wish to suppress 
the protocol number in that context, we use z for the number of trials and л for 
the number of occurrences of category k. Тһе background theory indicates 
which trials are to be inspected for trend or the lack of it. We shall suppose 
that they have been extracted from each protocol, with the earliest trial labelled 
as trial 1. 

The null hypothesis is then that all distinct permutations of the 1 responses 
are equally likely to occur. The general alternative trend hypothesis is that the 
lower-numbered categories of response become more probable as trials increase. 


Calculation of S The ith protocol can be rewritten as an ordered cx m con- 
tingency table with response categories as rows and trial numbers as columns. 
Each column will contain one entry while the total number of entries in row & 
will be Nik. 

The computation of Kendall’s S (see e.g., Hays, 1963, pp. 649-650) for 
any such ordered contingency table can be carried out by the following algorithm. 
For each cell of the contingency table in turn, take the total of the frequencies in 
the cells lying ‘ south-east ’ (below, right) of it, subtract the total of the frequencies 
in the cells lying ‘ south-west ’ (below, left) of it, and multiply the result by the 
frequency in the cell considered. Summing the results over all cells yields S. 
Two numerical examples are given below: the first is typical of individual 
learning error-success sequences; the second has three possible categories of 


response. 
ExAMPLE 1 
Trials 
1 2 3 4 5 
Error 1 1 
Success 1 1 1 


for which $= 1(3—0)+0(2—0) +0(1—1)+1(1—2) =2. 
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EXAMPLE 2 dus 
1 2 3 4 5 
Response 3 0 1 0 0 0 
Categories 2 1 0 1 0 0 
1 0 0 0 1 1 


for which 8-(3-1)--1(2-0)-- (2-0) =6, the values for the zero cell entries 
having been omitted. The directional ordering of the response (row) categories 
is immaterial, so long as one is consistent: one order gives + S, the opposite 
gives the complement or —S. We have adopted the convention that an 
increasing proportion of ‘ successes’ over trials is reflected by positive S scores. 

Equivalent forms of the algorithm can be used directly on the sequences 
instead of the c xm matrices. One equivalent form is: assign to each trial in the 
sequence a score consisting of the frequency of higher-numbered categories 
which follow it, minus the frequency of lower-numbered categories which follow 
it, and add these difference scores over trials. Thus, the sequence 21312231 
yields an S оҒ(2-3)--(4-0)--(0-4)-(3-0)--(1-1)--(1-1)--(0-1), which 
equals 1. . 

For binary (success-error) sequences with z, successes and Ng errors, if we 
define и to be the number of successes which follow errors (summed over all 
errors), then 

S=2u—nyng. 

For example, the sequence ESEESSSE has u=4+3+43+0=10 and 8-2(10)- 
44-4. Тһе factor min, represents the maximum possible value of w. If 
instead one counts errors which follow successes, to get different numbers 
ш and S' —2u' — тул, then S' is the negative of S. For our illustrative sequence, 
u'=6 and S’=—4. Since 8” is the negative of S, it follows that u +w = nto. 
The calculational procedure decided upon must be used with all protocols. 

S is not defined for sequences having occurrences of only one response 


category (e.g., all successes). Such protocols obviously cannot be used for a 
test based upon the possible р 


ermutations of the response categories. Іп the 
following, when N denotes the number of protocols, it is to be understood that 
these are protocols having туту > 0 for some categories z and j. 
Null distribution of S On the null hypothesis, given the marginal frequencies 
Nk, all trial-wise permutations of these response categories are equally likely. 
In this case, Kendall (1955) showed that Si, the value of S for the ‘th protocol, 
approaches (with increasing mr) to a normal distribution, with mean zero and 
variance given by , 


1 с 8 
of= gm- Y nat) (аа Y nu), (1) 
k=1 6 i-i 
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where 744 is the frequency of category k for sequence i, and 7; is the total number 
of trials in sequence 7. For a dichotomy (c=2), as in success-error sequences, 
eqn. (1) reduces to З 
оё = (nignig)(ni + 1)/3. (2) 
From the foregoing it follows that х; = 5:/0: approaches to a unit normal deviate. 
Hence x; would provide a test statistic for the null hypothesis for the th sequence. 
Of course, for small 7; one would be concerned with the approximation of x; to 
normality. We consider this matter later, since it is not often that one is 
interested in a significance test for single protocols. 

However, before developing our group test, it may be of interest to point out 
the relation of Kendall’s S оп a dichotomy (for a single protocol) to the Мапп- 
Whitney U test and the Wilcoxon test (cf. Siegel, 1956). Тһе Mann-Whitney 
U statistic is equal to our и (or u”, whichever is smaller), and hence U is a simple 
transform of S. The Wilcoxon T statistic is a sum of ranks (or trial numbers of 
successes, say) which is related to S according to 


Т=5 2 (n+). (3) 


While the Mann-Whitney and Wilcoxon tests are usually viewed as tests for 
whether two groups differ, for our dichotomous case the two ‘ groups’ being 
compared are the successes and errors in a protocol, which are ranked by their 
trial numbers of occurrence. If tests are desired for single short sequences, 
where the normality approximation of S is in doubt, then the Mann-Whitney 
or Wilcoxon statistics should be used (instead of 5) since their exact probability 
levels are already extensively tabled for small z, and z, values (cf. Siegel, 1956). 
The decision, of course, would be equivalent to that obtained from calculating 
exact significance values of S, since U and T are merely linear functions of S. 


Test for group trend Let N denote the total number of sequences for each of 
which x;=Sjo;- has been calculated. Оп H, each x; approximates to a unit 
normal deviate. Hence, on Н,, the statistic 
N 
z= х «ГУМ (4) 
des 
also approximates to a unit normal deviate. We propose 2 as the test statistic 
for the null hypothesis that the N protocols, considered as a group, show no 
overall trend in success probabilities. One can use either a one- or two-tailed 
significance test, depending on whether a directional trend has been forecast, 
since the test is equally sensitive to either an upward or downward trend in the 
observed response probabilities. | f 
Table 1 gives a numerical example of the calculation of z for ten hypothetical, 
success-error sequences which reveal a small but consistent positive trend. 
The overall z is 2:38 which allows rejection of H, at the P=0-017 level (two- 
tail). For comparison, the sequences were Vincentized into quartiles, with any 
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odd trials being discarded from the middle of the sequence, and the Suppes- 


Ginsberg chi-square computed. This was 6-19 (3 df), giving P> 0:10 on Н,. 
In this instance, 2 detects а small trend whereas the chi-square fails to do so. 


TABLE 1. THe CALCULATION OF z ron 10 Success—Error SEQUENCES 


Protocol i nj Nai Si бі oi^ *i 

1 2 3 4 3:46 0:289 1:155 

2 1 5 1 3:16 0:316 0:316 

3 4 2 4 3-74 0-267 1:070 

4 d 5 3 12:30 0-081 0:244 

"s 5 2 4 2 431 0:232 0:464 
6 3 3 1 4:58 0:218 0:218 

7 3 4 4 5-65 0-177 0:708 

8 2 2 4 2:58 0-388 1:550 

9 5 3 9 6-70 0-149 1:343 

10 1 3 1 2:24 0:446 0:446 


10 
Exj—7:514, E gj71 =2:563, N=10. 
isl і=1 


M a 
2-15 „7514 _ 2:38 
VN ую 


4. APPLICATIONS 
Before entering upon a discussion of technical details, we wish to illustrate 
osed z test for trend. The main restriction on 
ies can be ordered a Priori, or that the responses 


i inate (1.е., each rearrangement of the rows of а 
схт Contingency table gives a different 5 value). 


daa ngle саб Я і е 
Іеагпіпр іп Paired-associates) eo A my ao im 


э ommissions, extralist intrusi 
д » 
» ог alternatively, the number of correct letters in à 


-learned responses, a theory may 


random guesses), 
cesses and examinin 
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analyses as before apply for any sub-sequence of the original sequence, since 
only the order of trials is important, though the analyses will not necessarily be 
independent. 

Another example is confidence ratings (e.g., regarding the correctness of the 
response given). These can be treated in several ways: (a) the confidence 
ratings alone could be examined without reference to the correctness of the 
response given, (5) the sub-sequences of confidence ratings for error and correct 
response trials could be extracted and analysed separately, or (c) the confidence 
ratings could be ‘ unfolded ’ in conjunction with the correctness of the response. 
For example, if the subject is allowed 5 confidence categories (e.g., 1 =lowest, 
5—highest confidence), then a correct response with confidence j is assigned 
score +j, whereas an error with that confidence is scored as —j. The scale 
from +5 to —5 is then ordered in something like ‘ relative strength of the 
correct response ' and the trend test сап be applied. 

Another example is a ‘ second-guess ’ experiment (e.g. Binford and Gettys, 
1965) where the three scores could be 1 if the first guess is correct, 2 if a wrong 
first guess is followed by a correct second guess, and 3 if both guesses are wrong. 
A further illustration is from an experiment by Bower (1967), where on each 
trial, to each paired-associate stimulus, the subject ranked all с available responses 
in terms of their likelihood of being the correct response to the stimulus. The 
score assigned for the trend test would be the ordinal position of the correct 
response in the trial-by-trial rankings given to a stimulus. 

Metric data collected during learning provide further material for the trend 
test. This includes response latencies and response amplitudes, e.g., GSR, 
eyeblink, incidence of alpha rhythm in the EEG, etc. When these are recorded 
collateral to a primary response coded as success-error, or as CR-nonCR, then 
separate analyses of sub-sequences can be made, e.g., trends in latencies over 
error trials and over success trials, or trends in latencies on trials following error 
trials and following success trials. For such metric or continuous response 
measures, the number of categories is in practice simply those distinct values 
of the measure which occur in a particular protocol; in cases where all the scores 
are different, then all z; = 1 and eqn. (1) for o;? can be simplified. S is calculated 
as before, as the sum over trials of the frequency of succeeding scores which 
exceed the given trial-score minus the frequency of succeeding scores which it 
exceeds. 

Since S and z are relatively fast and easy to calculate in all the above cases, 
these statistics are recommended for their convenience, range of applicability, 
and power as tests for trend in such data. We now turn to some more detailed 


statistical considerations regarding the 2 test. 


5. STATISTICAL PROPERTIES OF THE TEST 


The following sections discuss the convergence of z to normality, a con- 
wer of z in detecting different types of trend, and its 


tinuity correction, the po y 1 
Specific calculations below 


power efficiency with respect to the chi-square test. 
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when the test is applied to multiple response categories. Moreover, though our 
investigations of convergence and power are carried out only for ‘ short sequen- 
ces of three to six trials, the results also apply to longer sequences, since the 
approximation of S to normality increases with sequence length. 


Approximation of z to normality. Though the proposed test utilizes the approxi- 
mation of z to normality, this can be avoided, in principle, by explicitly convolu- 
ting the М individual exact distributions of 5; (or «;) and thus obtaining the 
exact distribution of z. However, the labour involved makes this exact pro- 
cedure impractical and is an argument for the convenient normal approximation. 

The goodness of this approximation of z to the unit normal distribution 
depends in general both on the lengths of the Protocols (the лг), and on the 
number of protocols (N), which are pooled. Тһе speed of convergence to 
normality may be investigated by comparing the first few cumulants of the 
distribution of z to those of the unit normal. 

For convenience in the following, let *ri denote the rth cumulant of xp 
yr denote the 7th cumulant of z, and mu =i. On the null hypothesis the 
cumulants of x; for a dichotomy are as follows (cf. the formulae for the general 


case of 5 in Jonckheere, 1954 а, Б); all the odd-numbered cumulants are zero, 
since the distribution of S is symmetric, while 


Kgi— 1, 
Ж 26 mio 1 
à m m]: 
48 24n 2ni?-- 2n, —1 
and gga 4 1 J] [Qnetzu-1) (45) 
n Ket 7+1) T Тт +1) ыы, wass (5) 


(m + 1) ` 
Hence, the odd-numbered cumulants of z are zero, while 


and : 

Ye= 2 Kei. 
eeu d unit Дош, а has the same first three moments, y4 CON- 
at the rate of N- > Ys 18 Zero, while Ув Converges to zero at the rate 
the approximation d b Max у= — L-5/N and max y.—9775[N*. Thus, 
eben M 2 to the unit normal distribution is quite close, and con- 

Pidly as N, the number of Protocols, is increased. 


атда Si is a discrete tandom variable with successive values 
fori=0,1,... 49° iea: TE meng, is even, S han the possible values +24 
rivera т/2. If mis odd, S has values £(1+21 fori-0,1,..., (m—1)/2. 


| 
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Therefore, when the zz; are fixed, the sum of N of the S; has possible values 
which jump by two’s. When the exact distribution of У S; is approximated by 
the continuous normal, the discreteness of > Si slightly affects the tail of the 
distribution which is important for assessing the level of significance of particular 
results. In the simplest case, when all zz; and 74 (and hence о;) аге equal, z is a 
constant times >; Si, and a correction for continuity is effected in this case by 
reducing the numerical value of > S; Бу 1 before computing z. When the 7 's 
(and hence o;'s) are different for the various protocols, an appropriate correction 
for continuity would seem to be 


1 Si 1 
= dl x= -gXab (6) 
If all о; were equal to a constant о, then eqn. (6) reduces to 
, I2 5-1 
ee N C 


Of course, as N becomes large the correction factor becomes negligible and can 


be disregarded. 
Figure 1 illustrates the value of the continuity correction when Ñ is small 


(4 in this instance). Тһе distribution of S for two successes and two errors 
was convoluted four times to obtain the exact probability distribution of > S; 


ac N=4 
° Exact Prob. 
A&A Uncorrected z 
«БЫ о—о Corrected 2 
44 
€ 
N 
ác 
м 
L 
Es 2 
1 
44 
0 


= 
16 14 12 10 8 6 2 0 


Ficure 1. Approximation of corrected and uncorrected z to exact tail probabilities. 
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(ог 2, since oj is constant here). The tail probabilities, P(z > К), for the un- 
corrected and corrected z were compared to those of the normal deviate. 
Figure 1 shows that the exact tail probabilities are well approximated by the unit 
normal function when its argument is the corrected z, but not when its argument 
is the uncorrected 2. Thus, when N is small, the continuity correction materially 
aids the normal approximation to the exact probabilities 


6. POWER or THE 2-Тевт 


. Support for the null-hypothesis of stationarity can always be obtained by 
using a small enough sample of short Sequences. Only by assessing the power 
of the statistical tests employed can we avoid this facile acceptance of the null- 


hypothesis, and hence spurious support for the learning model under considera- 
tion. 


For our case, H, is the nul 
marginal totals 7; and т, Н, 
and 74; errors are equally li 


l-hypothesis of no trend; in particular given the 
states that all distinct permutations of Ni, Successes 
kely. Тһе general alternative is that there is, say, 
an upward trend; in particular, that Sequences with high S-values (with successes 
clustered at the end) have relatively high probabilities. 

_ As often happens with non-parametric tests, the basic problem in calculating 
their power is the difficulty of deriving the distribution of the test statistic 


(z here) given only general specification of the non-n i 
-null hypoth the 
equally likely restriction is removed. When equal Mas of di did 


š Such a scheme will always 
nic trend observed in practice. 
hypothesis, Ay, assigns a probability 


lternative hypothesis, Hg, considered 


a се increases linearly with its S value, 
Pr (sequence with value 8)- D ) É + 5 5 a 
The trend parameter is 4; к і 


—— — 
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Consider a simple example, namely, one success and two errors. There are 
three sequences possible, corresponding to the success occurring on trial 1, 2 or 
3. The S values assigned to these three sequences are —2, 0, +2, respectively, 
and their probabilities on Hg are (1/3)—d, (1/3), and (1/3)+d respectively 
(where 4<1/3). In this simple case, the trend can be read off directly from 
the sequence probabilities; in general the proportion of successes on trial 7 is 
obtained by summing the probabilities of those sequences having a success 
on trial 2. 

Several simple cases are summarized in Table 2. The left-hand column, 
labelled z,/n;, gives the number of successes and errors. The next columns 
give the trend in proportions of successes over trials, while the last column gives 


the constraint on d implied by eqn. (7). 


‘TREND FOR VARIOUS CASES ОЕ л; SUCCESSES AND 71g ERRORS 


‘TABLE 2. 
TRIALS 

nj [ng 1 2 3 4 5 6 а<х 
1/1 0:5-0-54 0:5 +0:54 1:000 
1/2 . 033-d 033 0:334-d 0-333 
2/2 05-34 05-4 O5+d 0:5+34 0:083 
2/3 04-64 04—34 04 0-4--3d 04-6а 0:033 

05-34 05-34 05-94 05-154 0:011 


3/3 05-154 05—94 


Тһе cumulants of the distribution of S under На may be obtained as follows. 


From eqn. (7) we have 
Pr {Si=S|Ha}=Pr (Si- S|H,) (1+ =S), 
where a= ( a ) -A Hence, if Ga(x) and G(x) are the probability generating 
1 
functions of S оп Ha and Н, respectively, 
Ga(x) = G(x) + axGo (х). 
Putting х= еі, we obtain a corresponding equation relating the characteristic 
functions of S on Ha and Н; 
Palt) = $o(t) + «фе (0). 
Differentiating and putting 2-0, we can now express the raw moments, (а), 
of S on Ha in terms of the raw moments ш (0), of Son Ну. Thus, 
pr’ (d) = ш (0) + аш" (0) (r=0, 1,...). 
In addition, since the distribution of S on H, is symmetrical we have 
i s (0) -0. 
Hence, if Ki(d) and Ki(0) are the ith cumulants of the distributions of S on Ha 


and H, respectively, we have 
Ky(4)- оК.(0), 
K,(d)=K,(0) (1-г7К.(0)), 
K,(d) = « (K4(0) - 20? K ;*(0) }, 
Куй) = K (0) ~20°K (0) (2K (0) + 30°K,%(0)}. 
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Assuming all N protocols to have z successes and тз errors, then the rth cumu- 
lant of z on Hg will be 
Ar= MC) c —rI(0) кДа), 
where ka (0) =(mn,|3) (п +1). 
And in particular, we have for the mean and variance: 
Ah e (Nis (0)) — Ad /(N), 
Аа= V {1 - o*«(0)) = /(1— 4°), 


wee e) fi 


The approximation to the cumulants of a normal di 
adequate, even for the simple example given above where n,=1 and п,-2. 
Putting d=0-10, giving a three-trial trend of probabilities 0-23, 0-33, 0-43, then 


A, =0-245./N, = 0-94, A3= -0:338/ү/М, апа à= 277 1-16/N. Even for im 8 
the third and fourth cumulants 


normal approximation for its distribution. 
Hence the power of the z-test against the alternative Не is given by 
Pr{z> C,|Ha}=G(U), 
C.-A,_C,—-Adv (N) 
h = a 1 = а 
bs Uh C Vd 
C, is the unit-normal deviate giving an a-sized critical region (e.g., Coos 1:645 
for a one-tailed test), and G(U) is the probability that the unit normal deviate 
is greater than or equal to U. 


It can be seen that the power of the test increases wi 
the length, л, of the pro 


stribution appears to be 


‹ ith the number, N, and 
tocols, and also with thesizeofthe linear trend parameter, d. 


imply logistic learning curves, 
ive to the null hypothesis. Statis- 
сеп investigated by Cox (195 8), 
and their relevant results may be briefly 
Presented. 
Шен equal 1 or 0 according to whether a Subject has a success or an error on 
trial 7, The logisti 


ic trend for the inc, i ili is given by 
the following expression: “ease in probability of success is gv 
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: The likelihood of obtaining a particular sequence of л responses t, . . ., tn is 
given by 
Ang 
ae. 
(14-40) 
-1 


Lyn = 


n n 
where ту = > 2, the total number of successes, and T= 228, the summed 
ісі i=1 


ranks (trial numbers) of the successes. Since T= 3 t E (n+1), where S is 


Kendall’s measure used above, we now have 
Ат. Qni (91)/2 95/2 
=>. 
II (14-460) 
і-і 
Let Ng, n, be the number of permutations of 7, successes and n—n errors which 
give the same value of S. "Тһе probability of obtaining a particular value of 


S with л, successes out of z trials is 
Pr(S, n, n) - L,Ns, s, (8) 
Hence the probability of z, successes occurring is given by 
Pr {ny}= E LNs, np 


where the summation is over all possible values of S. Thus the probability of 
obtaining a value of 5 conditional upon m, successes occurring 1n 7 trials is, after 


Ln 


reduction, 

М. „05? 

Pr ( S|ny, n, O= улту EA (9) 
С А 


Using the known generating function for Ns, „„ it can be shown that 


X Ns 6920-чы | (LO 
У Ns, 1-9 J' 


i=l 


and hence that 
m y 1-8 
Pr{S lm, п, 0)= Ме, „8 ne ( E жа) (10) 


i=1 


When 0-1, we have the probability distribution of S under Н,, viz., 


Pr{Slty 2,01) - Ns, s, | » (11) 


The probability distribution of S under H, is easily obtained from eqn. (11): 
one simply multiplies the probability of S under H, by 05/2, and divides by a 
factor that makes the probabilities sum to one. In comparison with the previous 
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scheme, whereas the linear trend added dS|2 to each probability of S arising under 


Hy, the logistic scheme multiplies each probability by amounts proportional to ` 


05/2. 


We can also easily obtain the cumulant generating function of the distribu- 
tion of S under Н, as follows. Let G(x; 0)= > Pr{Shn, n, 0]1x* be the proba- 
8 


bility generating function of S under H, From eqns. (9) and (11) we have 
а ) Pr(S|m, n, 0=1} өз 
Pr(S|n, n, 0}= = А 
X Рг], 7,0-1 josie 
s Mh 
‚ p 6(6/2;;0--1) (12) 
and thus G(x; 0)— CORTS а 
Putting x —et, 0 еза and taking logarithms we obtain 
Ve £) +106 Ges; 0— 1) еа; 0=1), 05) 


an expression relating the cumulant generating functions A(t; 6) and (1--а;0-1), 
of the distribution of S under H, a 


nd Hy. Thus, if (0) and (1) stand for 
the ith cumulants of $ under H, and Н,, we have 


© i © 1 
Zz = E Ouray, 


Equating the values 


of t, the cumulants of S under H 
of those under Ah. 


ө may be expressed in terms 
In general we have 


= S e), 


rai 


0 а e 
Kati (0) = «ку o(1) + 3j 2401) + gj atl)... 


a? 4 
yr T nsa) ita 


where 7=0, 1,... and a= log 6/2. 


Thus In so far as the distribution of s 9n the null hypothesis, Hy, approache? 
normality quite rapidly, the distribution of ç on the alternative hypothesis 429 
also rapidly converges to а normal distribution for small values of 0. Hence 
the distribution of z on H PProximated by a normal distribution» 


a Will be well а : 
even for small values of ? (the lengths of the individual protocols), provide 
(the number of Protocols) is reasonably large, 


ы) 
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We may also easily obtain explicit expressions for the mean and variance of 
S under H,, since the probability generating function of S under A, is 


т [mio — yo (mt) т 
вз =)= П | |6. 


Hence putting x= et and 0= e=, taking logarithms, and substituting іп eqn. (12) 
we obtain an explicit form for the cumulant generating function (2; 6). Differ- 
entiating and putting #=0, we have in particular 


x (0) = > Ге, +7) coth {a(m,+2)}—z coth («i | ; 
ісі 


m P (nt 
к{#= Х args ~ аав 9)" 
where «--Іор 0/2; 
fis бағы 4-1 ./6--1 
ог x0) = à [+a (енің) -2 (53) | А 
f ; \2 ndi y? 
айа «(0-4Х oi (ІС) —6n, (ғас) | . (14) 


To illustrate the adequacy of a normal approximation to the distribution of 
S for a special case of H,, consider sequences with two successes and two pen 
The non-null distribution of S, obtained from eqn. (12), is shown in Table 3. 


TABLE 3. DISTRIBUTION ов S ron Two Successes AND Two ERRORS UNDER Ho [FROM 


EQN. (13)] 
S Ns 6939/2 — Pr(S) 
4 1 e 46% 
2 1 өз Ag 
0 2 6 240: 
-2 1 0 40 
-4 1 1 A 


where А — (1 — 0) (1—03) /(1— 03) (1-0). 


The mean and variance are given by eqns. (14), with z»,22,—2. If 0—2, for 


example, 


72 5736 
к(2)- 35 =2:06, and «,(2)= 129 -4-68. 


The distribution of S for 0-2 was convoluted N=4 and N=8 times to 
determine the exact probabilities of the distribution of z. For fixed т and ns, 
х approaches normality if XS, does. Hence, the exact probability that 25% 
exceeded K was compared to that calculated (with continuity correction) on the 
assumption that XS; was normally distributed with mean Хк,(2) and variance 


M S.P. 
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Nx«(2). "The comparisons of tail probabilities for 0—2 and for N=4 and 8 are 
graphed in Fig. 2. The normal approximation is already quite close for N —4 ` 


(maximal discrepancy of 0-025 at K — 10) and is much improved for N=8. Тһе. 


0 —2 case is useful to examine in this r 
of 0 values, this one induces the 
normal approximation for smaller 
fore be expected to be even better 


espect since, within the reasonable range 
maximal skewing of the S distribution. The 
[9 — 1| values, or for larger 7, or N, may there- 
than in Figure 2. 


о 


Nea 10 


9 а М-8 
Е e—e Exact Prob. 9 Ñ — e—eExact Prob. 
8 ° °Normal Approx. 8 9 ©Normal Approx. 
T 7 
gs z6 
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FIGURE 2, Adequacy of z approximation for Special cases: (a) 0=2, N= 4; (b) 0=2, N=8- 
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t against this type of trend will be G( ШЕНІ 


C, is the unit normal deviate giving a 


һе unit normal devia 
Noted that for both 
; the probability оғ Suc 


П a-sized critical region, and G(U) is the 


te is greater than or equal to U. - 
the linear and the logistic trend schem | 
cess at any trial is considered independen 


Saw тн 
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of the outcome on previous trials; and hence these alternatives do not represent 
cases where probability changes are contingent upon the prior sequence of 
responses made by the subject. 


Relative efficiency of z and chi-square In this section we attempt an approximate 
assessment of how much more efficient is the z-test for trend in comparison to 
the chi-square tests in current use. The relative power efficiency of the 2 and 
chi-square tests may be roughly estimated as follows. Selecting values of 715, п, 
and the trend parameter (4 or 6), let Nz and Ng be the number of protocols 
required to have power P when using the 2 and chi-square tests for trend respec- 
tively. Then the ratio Nz/No estimates the efficiency of 2 relative to the chi- 
square for this specific case. This ratio generally can be expected to be a 
function of the m, the trend parameter, and the power-level of the comparison. 
In the following, we shall use the 50 per cent power level for comparisons at a 
significance level of а--0-05. Calculations of power for the chi-square statistic 
are complicated by the fact that under the non-null hypothesis the usual statistic 
is distributed as a zon-central chi-square, with a non-centrality parameter 
depending upon N and the trend parameter. Strict general solutions are 
difficult to obtain and we shall employ the following approximate methods. 
Suppose a chi-square with k—1 df has ап о =0:05 significance value of R. Ву 
setting observed frequencies in the contingency table equal to those theoretically 
expected on the trend hypothesis and then calculating the chi-square, we obtain 
a rough estimate (which improves as N becomes larger) of the mean of the 
non-central chi-square. This estimated mean will be a function of the number 
of protocols, N, and the trend parameter, dor 6. Because of the skew of chi- 
square distributions (i.e., the median is less than the mean), if we set this esti- 
mated mean equal to R and solve for №, then the power for this value of N is 
somewhat less than 0:50 (i.e., the probability area above R is less than 0-50). 
Thus, by putting this value of V equal to Ng we shall be giving the chi-square 
a slight advantage as compared to the z test when estimating the 50 per cent 
power efficiency. 

A final word before beginning the comparison. The trend schemes 
illustrated assign probabilities to entire sequences of successes and errors, and 
they do not re-assign that probability independently to each response in the 
Hence, with sequences & trials long, one has N protocols each with 
ervations, not Nk independent observations. "Thus, Cochran's 
the chi-square for А related observations is the appropriate test 
schemes (cf. Cochran, 1950; Siegel, 1956). In the Suppes 
paper, Cochran’s test was not used because the particular 
d assumed that successive responses were independent. 
e not to have decisions about trends dependent 
dependence of successive responses. Thus, 
ave wider applicability, as well as being 


sequence. 
В dependent obs 
modification of 
statistic for our trend 
and Ginsberg (1963) 
null model being teste 
However, it would seem preferabl 
upon assuming or demonstrating in 
Cochran’s test would seem generally to hi 
required by our specific comparisons. 
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Cochran’s Q statistic is defined for N subjects (sequences) each yielding k 
related observations (trials) as follows: 


k 
Kk 1) X. (Gj- 9): 
Qia- N = N ш 


where 14 is the number of succes 
of trials, G; is the total number 
of successes per trial. McN 
when k=2. On the null hy, 
trials, O is approximately dis 

For our calculations, v 
k=n, replace 1; Ьул, 
the mean success prob: 
estimate of the mean 
meter. Equating thi 


end for the 1/2 case from 

2 2, > P2=0'33, and p3=0-334d. With appropriate 
substitutions, Q in this Case reduces to 

2n(n — 1)Nq* 

= Itin Das =6Na°. 

n(n —~n,) 
Square value above 5-99 is si 
d, the N required 


With 2 df, a chi- 


gnificant at the 5 per cent level. 
Hence, for given 


to make O, > 5.99 is 
5-99 Ж 


Proceeding analo 


gously with the z-test 
with a two-tailed test 


» 


for this case we have, for о-0:05 


246dA/N > 1-96, 


e 

rent constant divided by 4. Hence, ve 

А No which gives the propor еден 

15 used to detect a trend. Т ve got 
T Cases ОЁ n, successes and n, errors, This ra 

. ` B 2 Г 

із independent of 4, but this follows from our choice of the 50 per cent power 


ze : " me 
Me 271, z and Q are equivalent tests with the ied 
c ICIenCy of Q relative to 2 decreases as the pa ampi E 

4 w 

5 18 te » Since S then has a larger range over WA Б 
ю vary. The indices іп Tab slight Over-estimates of the true est 
he chi-square power is actually somew 
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less than 0-50. If in addition a directional trend is predicted, enabling a one- 
tailed test with z, then the relative efficiency of z improves even further. The 
ratios in Table 4 may then be reduced by multiplication by 0-705 =(Co-10/ Соо). 
The chi-square, of course, is inherently a two-tailed test. 


TABLE 4. EFFICIENCY oF Cocuran’s О RELATIVE ТО z IN TESTING FOR A LINEAR TREND 
WITH 71, SUCCESSES AND zs ERRORS 


[па NIN 


1/1 1:000 
1/2 0-634 
2/2 0-490 
2/3 0-405 
3/3 0:347 


Тавгв 5. Numper or 2/2 Cases REQUIRED BY 2 OR О Test To Have PnonanBiLITY 0:50 оғ 
DETECTING A LOGISTIC TREND WITH PARAMETER 0 
Average 
increase 
0 per trial N; No 
14 0-025 2540 4830 
12 0:045 70:4 144-7 
13 0:063 35-0 120 
14 0-081 214 443 
1:5 0:096 15:2 31:3 
1:6 0-110 11:5 247 


А somewhat different approach is illustrated for the logistic trend in Table 5. 
It shows the Nz and Ng required to have 50 per cent power when there are two 
successes and two errors with sequences generated by the logistic scheme with 
parameter 0 (cf. Table 3). These use an a=0-05, two-tail rejection region. 
The four trial-proportions, ру, have a mean of 0:50; the average trial-to-trial 
increment as a function of 0 is indicated in the second column of Table 5. Тһе 
efficiency of Q relative to z in this instance is approximately 0-50, independent of 
0. This is reminiscent of the results in Table 4 with the linear trend, where 
relative efficiency was independent of d. Interestingly, Table 4 also gave a 
relative efficiency near 0:50 (0-49 actually) for the 2/2 case and a linear trend. 
More cases will have to be examined, but on these preliminary grounds it 
looks as though relative efficiency of z to Q at 50 per cent power is primarily 
a function of protocol length and is unrelated to amount of trend (d or 0) and 


type of trend (linear or logistic). 


7. HOMOGENEITY OF SUBJECTS TRENDS 


It is perhaps obvious that without a test of inter-subject homogeneity, 
non-significance of any overall test of trend for a sample of protocols does not 
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š МӨР š itv of 
necessarily entail consistency with a learning model esca теңге d 
ivi z i П group trend and, 
individual sequences. "The z test is for overa : d 
Rage could fail to reach significance if different protocols show strong trends, 
but in opposite directions. | | = 
The following test for the homogeneity of trends is a useful in nie 
interpreting the outcome of the z test. Its only drawback is that t e s we 
employed converges rather slowly to a chi-square distribution, and thus i 


i ith the 
restricted to longer protocol Sequences than those which can be used with 
2 test. 


bout the observed mean à. This suggests 
Since under H, each x; врртоесій 
e in the limit as z;; and 745 become large, it follows ron 
n with one degree of freedom oe 
On independence assumptions, the sum s ° 
ocols will thus approach to a chi-square distri “A 
tion with N degree of freedom. Thus У xi? is not an appropriate test a 
however, since it can be large simply because t 


А м іпсе 
mean, #, is large. But we can subtract a correction factor for the mean. Sinci 
2 approaches the unit normal distribution, 22 


bution with one degree of freedom. It therefore follows that 


ow a ; а 
quare distribution with N—1 df as each mj an 
Statistic, which js ju 
mean, is proposed as an 


spa x test 
? It is independent of the 2 a 
mally distributed, and so any patte 

Statistics W and 


5. 


m of many non. the 
er Showing these convergence properties, we turn к н 
Pproximation of W (or a modification of W) to familiar distri 


to chi-square, Aft 
question of the a 
tions, 


о 
-Square variate with N—1 ean 
2181 cumulant qual to N— 1, its second to 2(N — 1), its t o 
to SN — 1), and its fourth to 48(N — 1). nowing the theoretical pom 
5ч, tedious algebra assisted by Fisher's R-statistics (Kendall, 1958, p. 281) enab 
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one to derive the cumulants for x;2 and hence for W. The first four cumulants 
of W will be denoted as т; to ту and prove to be: 


m=N-1, 
N-1\2 
an= ijy (%-) Siale 


N-1\? =Д\* 
тэ=%(М-1)+12(®т—) xus) Ew 


q,248(N —1)-- 144 (x rai +24 A) Укы 
а= 4 Я 
+ (% *) У kat < D (&N*-9N +6) (= 2 ° үй 


where к and кеі are as given in eqn. (5), and xg; is the eighth cumulant of xj, 
not presented here (cf. Jonckheere, 1954) due to its length. Examination of 
these expressions shows that these cumulants approach those of a chi-square only 
if the extra summation terms on the right add to zero. But it is unlikely that 
these extra terms effectively sum to zero, except when each ni and 74; becomes 
large. For example, consider тә: as the number of protocols, N, becomes large, 
the extra term is essentially Xj қа. But from eqn. (5) it is known that <a is 
always negative, converging to zero only as 1 and nig become large. Hence тз 
in practice will always be less than the second cumulant for a chi-square variate, 
and the more so, the shorter the sequences involved. Similar remarks apply to 
a, and z £ The cumulants of the individual protocols enter in such a manner 
that summing N of them does not eliminate disturbances introduced by a few 
short sequences. This leads to an obvious caution regarding use of W as а test 
statistic: И/ should not be used with short sequences (say, 741 OT Mia less than ten) 
since it is not then exactly distributed as a chi-square statistic. 

The best approximation to the exact distribution of W for small values of ní 


appears to be obtained by normalizing W, using a non-linear transformation 


derived from Edgeworth's Type A expansion of frequency functions (cf. Kendall, 


1958, pp. 163-166). 


If we put 
w= r0 and [з= тз[т2/°, 
L 
then ¿=w T (2-1) 


will be approximately а unit normal deviate for even small values of ni and tits. 
is approximation for selected values of N, 


Table 6 shows the accuracy of thi 
he exact values of Pr (Wz W,) were obtained by con- 


тө Mig, and W, where t! 
voluting the distribution of W for N=10 and 16. 
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"TABLE 6. Accuracy or NORMALIZED W COMPARED WITH Exact PROBABILITIES 
N та та Wo Pr{W>Wo} % Pr(£z &) 


10 3 1 1512 0:007 2-30 0-011 
14-48 0:024 2:06 0-020 
13:20 0:048 1:58 0:057 
10 2 2 16:86 0:011 2:24 0:013 
15:66 0:025 1:94 0:026 
14-64 0:049 1:67 0:048 
16 3 1 22-40 0-010 224 0-013 
21-60 0:025 2:00 0:023 
20:60 0:056 1:70 0:045 
16 2 2 25:05 0-010 2:32 0:010 
23:36 0:026 1:96 0-025 
21:86 0:051 1:64 0-051 


TABLE 7. VALUES оғ Kai AND ка [EQN. (5)] ron T, —73i nig S10 AND mi=ninis 


Bi mi Kai Kei 
2 1 —2:00 16:00 
3 2 -150 9 
4 3 —1:36 8:32 
4 —0:96 4:08 
5 4 —1:30 7-75 
6 —0:80 2-89 
6 5 -127 7-46 
8 -073 243 
9 —0-63 1:80 
7 6 -125 729 
10 —0-69 220 
12 -0:55 1:39 
8 7 -124 719 
12 — 0:67 207 
15 -051 1:19 
16 -047 100 
9 8 -123 742 
14 -065 ^ 198 

18 -048 1:07 

10 20 -042 0:82 
9 -122 7-07 
80-06 193 
a —0-46 1:00 

25 —0:39 0-71 
-037 0:64 
As n and ™ both beco; қ 
cumulant of the Standardized me large, L, tends to v {8/(N—1)}, the third 


chi-square distribution with М-1) degrees of 
freedom. То the extent that Ls is close to this value, it is pete т safe to 
ted as x with (N—1) degrees of freedom. If, 


kawi Sms o ————————— E 
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for example, ту — 13 — 10 for all i, and N= 10, then L,=0:82, V (8/(N-1))7 
0:94, and it is reasonably safe to assume that W is distributed as a chi-square 
variate with 9 degrees of freedom. 

To facilitate the computation of ку and кы, 
constants for all cases where m; <10. When m> 
to the value of «gi is given by 


Table 7 gives the values of these 
10, a satisfactory approximation 


100 , 
к= 751 Kai" + 


: Finally, Table 8 shows, as an example, the computation of 2, W, and their 
significance tests, for a sample of 10 protocols. It can be seen that z is only 
significant at the 12-7 per cent level, and hence that the null hypothesis of 
stationarity is upheld for the overall group result. Wis significant at the 1-4 per 
cent level, however, indicating significant heterogeneity of trends between the 
protocols. Hence, the evidence from this artificial example would not support 
a learning model predicting stationarity of individual protocols. 


Taste 8. EXAMPLE OF 2 AND W 'Trsts OF SIGNIFICANCE 


Protocol ті та Si ci Ni Kai Ko 
1 4 6 16 9:38 171 —0:39 071 
2 5 5 -11 9:57 —115 -037 0:64 
3 10 4 —10 1414 —0:71 —0:34 0:54 
4 2 6 8 6:00 1:33 —0-67 2:07 
5 7 6 22 14-00 1:57 -0:29 0:38 
6 4 8 -24 11:78 — 2:04 —0:36 0:60 
7 10 4 12 1414 0:85 —0:34 0:54 
8 8 3 18 9:80 1:84 -045 0:95 
9 9 4 -22 12:96 -1:70 -0:35 0:57 
10 7 4 20 10:58 1:89 -0:37 0:64 
N=10 3:59 —3:92 7:64 


У 
z= 2 xy VN=1-14; Pr (22:34) 20127. 
š 


N 
у= X хй-ай- 2243; 
ізі 


-1 


2 N 
my=2(N-1)+ A) a Kai 
N-1 
ЕТЕНЕ 2 


= т”-(М-1) -249; 
Vs 
L= түт! 20:691 ! 


t= 2 (аа) -220; 


Pr{€> &)- 0014. 


Wo 
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decision errors) be prespecified for eac 
that led to the invention of the R technique an 
rates for this technique are given here. Experiment- 


the probability of making one or more decision errors. 
part of the doctrine, the decision-based error-rate view leads naturally to a con- 


sideration of the probability distributions of ғ decision errors and such distribu- 


tions are given in this paper. 


1. INTRODUCTION 
ome attention to error-rates of the first 


kind in the analysis of data from statistical experiments, though this is not always 
done explicitly. If the only error considered is the type I error, statements 
about the hypothesis accepted, when the null hypothesis is rejected, must be 
somewhat vague. In such a one-error context the sample size must be chosen 
on the rather arbitrary grounds of time and materials available for the expert- 
ment. Although such factors are not unimportant, at least of equal importance 
is the size of a treatment effect we hope to detect. A concern with the size of a 
treatment effect we intend a statistical experiment to detect involves con- 
sideration of alternatives to null hypotheses and error-rates of the second kind. 

Given the choice of null hypotheses (ук), at least one alternative to each 
(ак), ап error-rate of the first kind (а) for the оку, an error-rate of the second 
kind (1—8) for the үкү and a test procedure for the о; the sample size (m) 
required to satisfy these conditions is determined. The specification of oii, кр 
a, 1 —f and test procedure not only makes explicit the intended amount of 


It is now common practice to give s 


- discrimination between hypotheses, it also compels proper emphasis on the 


design of statistical experiments. 
In this paper we shall consider a populations in which the variate Y is 


normally distributed with means py (j= 1,2,..., а) and common variance o. 
It will be assumed that each of these populations is randomly sampled an equal 


number of times (z). 


2. NuLL HYPOTHESES 


heses considered here are linear statistical hypotheses about 
al form of the ith such hypothesis is: 


oret Cala сана... + ciata di — 0. (1) 


The null hypot 
the py and the gener 


» Chosen by the investigator because. a 
their relevance to the Purpose of the investigation and d; is a real number whic 


, is i ast. 
may be zero. When, for fixed i, Усу=0, the hypothesis is termed a contr 


Any contrast in which one “4-1 another су= —1 and all other cy = 0, " 5 
difference between two means (with or without the added d;) and is terme 
comparison, Examples of (1) for a=4 аге 


о шз раш — 1000; 0K2—H —рә=0); 
oks =H; — pa — 0; oka =0:5(u +15) -05(и%ш)-0. 
The values of 4j, their sum of Squares апа 4, are 
4171, 1, 1, 1; Xoaf-4; d, — —100. 
j 
This hypothesis is not a contrast because Ley 0. 


The last three hypotheses are contrasts, oka and ық being comparisons. 


І experiments which use contrasts have all 
equirement that this be so. 


°Ач= (суру + азуу су? = 0. (2) 
3 2 

rs of ок; and oå; are 

= cM, +с,М,4. .. 


Тһе sample estimato, 


. +ааМа+4, (3) 

апа 

TSE yM + адуу; (4) 
4 2 

le means. If we mult 

c b this will Change the value of k; for given Mj, 

t Will be unchanged; 

unaffected by such a multiplication. 


ion about 044 and or is 
1 D It is Sometimes useful to consider 
normalized linear hypotheses, Such an is i 


h among hypotheses. ‘The que 

€ matrix T is is diagona 

non-zero elements being Xa? pCa “C,” and when this is diag 
3 


$^) the hypotheses ате mutually orthogonal. The 
index T indicates а transpose, 


Ж- 


Ww. 


EST а 
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9 


We may defi m р 
т пе pk, to have elements which are th 

eses with „Са of rank p. The total amount of ven ask w a s 

hypoth e Mj, 


fo i 
г the given di, accounted for by these hypotheses is then 


If the h кока (5) 
е hypotheses are mutually orthogonal, eqn. (5) simplifies to 
Lm=DLe 
т > i (6) 


If p= 
a and all 4-0, then eqn. (5) or (6) is the uncorrected sum of squares of 


the sample means, i.e. 
Ly, = USSB-2X Mf. 

УМ; (7) 
6) is the component for the 


If p= 
ф-а-І contrasts and all d; —0, then eqn. (5) or ( 
4 sum of squares between 


maximi Eh 
на аон contrast, which is the same as the correcte 
ample means, as in analysis of variance, i.e. 
(8) 


Lg -S8B-nY(M; -М) 
j 


where M. is the general mean. 

ай, | clique = analysis of variance procedures test Алп 70, for which 

statistical t h he estimator. Although this component plays an important part in 

sse s eory, it has been argued elsewhere (Rodger, 1967) that it is rarely of 
y interest to empirical research workers. The component Àm will be 


Used in, thi 
sed Re this paper mainly for theoretical purposes. 
‘elations (7) and (8) may be made to hold when all d,#0 by defining 
nd they are adjusted by the 


j= Y;m,. The Ў; are the observed means à 
lements of ,ma given by 


т) to allow for the dj. ‘The mj аге the e 
and ma 71d, (рСа асату ‘pCa 9) 
id,” has elements dr. 
The decision rule that will be used in this paper is to reject those o for 
(10) 


Which 

p= lape 2 Fa-2^ 
criterion. In 
om, and Fa-a 
and vs degrees 


this expression 8j is an 
^» is the value of the 
of freedom, which has 
he test of оки 18 


o not reach this 


an : 
d retain those which d 
degrees of freed 


esti í 
mate of c 2, with vs 


9 1 ' . 
uu s variance ratio distribution, with v; 
d © area (1-а)" to its left. This decision rule assumes that t 
: Ouble-ended, and this will be assumed in the rest of the paper. Should the 
st of ок; be single-ended, then a in (10) should be replaced by 2a. Tables of 
Q-a)” are given in Rodger (1965). 
3, ALTERNATIVE HYPOTHESES 
An additive alternative to the general null hypothesis given by eqn. (1) is 
22. Gata + di = 4 (11) 


,Ki = cala + йзн2 F 
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and the quadratic form is 
iA n8? [X ag? = o 2А. (12) 
j 


The quantities 8; and A? are non-central parameters. ‘The size of a еа 
effect which will have а type II error-rate (1- В) may be symbolized by 
2 A 
| ев the choice of 8; is somewhat arbitrary (just as the choice of di, 
even d;=0, is arbitrary), as information about an empirical domain accumulates, 
the size of an effect it is thought desirable to detect (êi) can be ever better 
justified. "Тһе smaller 8; is made, (һе larger must z be made to maintain constant 
error-rates of the first and second kind. In the absence of other information, 
a useful and not impractical choice is ôi=0 АУХсу Thus ш =0 would be 
tested against =o, and ш//2-цг//2-0 against ш/у/(2—и)]у/2= е 
which is the same as ш, -Назо,4/2. As more is learned about populations, 


smaller values of 8; may be used, if the scientific value of these smaller amounts 
justifies their cost in time, money and effort. 


Examples of ік; alternatives to the оку given in Section 2 are 
173 Y Ia ia +u — 100 — 26, ; 
1k2= H= 0.4/2; 
1K3—H3 -4 = 04/2; 
14,7 0-5(ш, +u) — 05 (13 +p) = 9. 
These аге the positive alternatives to өкі. The negative alternatives are usually 
taken to be к= — 8. Such Symmetry keeps 1 — for ak; the same as that for 
ix. In this paper the symbol үкү will often be used to refer to both alternatives 


to оку. It must be noted that the above alternatives are expressed in the unknown 
a, units and there is estion that any numerical value will be placed on 


no sugg 
this parameter, 
ples the 8; differ, but each 


In the above exam 
from its oxi. The differences bet 
differences in Ley. The fact that 


way makes the forme 
prefer 


1x is equally discriminable 
Ween the бү, in this case, are entirely. capu 
the size of 6, may be affected by Усу? in t 


; š 2 е 
T a misleading non-central parameter. In this paper W 


M nb? ty c (13) 
j 


derived from eqn. (12). This is 
eqn. (13) to the four above exam 
way of expressing the useful 
make & = 0/5 су, 

2 


1 
adapted from Patnaik (1949). If we Bio 
ples we obtain à? =n for alli, This is ano we 
tule that, in the absence of other information, 


нер с -- 


өт 
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| Should we wish to consider p hypotheses, with coefficient matrix rank p, 
simultaneously, the non-gentral parameter for the alternatives to these, con- 
sidered simultaneously, is 
. Am? тб (pCaaCp") ^ 581 gs (14) 
іп which the elements of pô, are the б, If the p hypotheses are mutually 
orthogonal, then (14) reduces to 

Am? = DAP (15) 


% 
and, should all № = А іп such an orthogonal set, we һауе 
Am? =pr». (16) 
After testing p=a null hypotheses, with coefficient matrix rank p, we may 


define pôi to have zero elements for the ок; accepted, elements Š, for those 
rejected in favour of ixi (because A; is * significantly ” positive) and — ó, for 
those rejected in favour of зк (because ks is * significantly ” negative). Such 
a set of decisions implies values for the py, which are the elements of 

Батар (CaaCp) ^ Ca Ma (17) 
шта being given by eqn. (9). If the hypotheses were p=a-—1 contrasts, 
with coefficient matrix rank p, then the right-hand side of eqn. (17) yields the 
values of шу —p., where p. is the general mean of the a populations. 


Clearly if the rank of pCa is less than p, then pCaaCp’ is singular and its 


inverse does not exist. But a more important reason for insisting that decisions 
be restricted to p hypotheses with coefficient matrix rank p is that this avoids 
For example, in the state- 


redundancy and contradiction among hypotheses. 
ments p —p2=0, ua — pa 70 and p -на-0,р-3 but pCa has rank 2. Each of 
these statements may be deduced from the other two; thus if the first two are 
true, then =u —pa and the third follows by deduction. If one of the three 
contrasts above is said to be false (whether 3; is stated or not), the other two 
cannot both be true. Thus, if the first is said to be false and the other two true 
this is a contradiction because the two true hypotheses imply that pa =з =n 
yet it is claimed for the first that u,zp,. Іп the same way, if we state that 
Ja 7 Ha 8 Ша из 92 and ш –из = ӛз are true, this contains a redundant 
statement when з= ô; + дз and a contradiction when 53% 5, + бу. 

The requirement that à; be expressed іп о, units is, on occasion, too vague 
and we wish to express alternatives to the oi; in the measured units of Y. Stein 
(1945) has shown that this is possible if we adopt a two-stage sampling plan. 
With this procedure we specify о, кь 1 — B, ікі and әк draw n replicates and 
use the resulting data to calculate 62 with v, degrees of freedom. Тһе non- 


central parameter for hypothesis үк will be 
М? = NSP [e Eci’, (18) 
2 


if we use a full N replicates. The symbol $; is used when үкү is expressed in the 
measured units of Y to distinguish it from 8; which is expressed in о, units. 
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Given vy уз, 67, ок, 9; ік; and 1 — B, we may calculate the N required, in the 
light of first stage data, to be 


N= [Ao андарун. 09) 


2 
The square brackets indicate that the fractional part is truncated. If N <n, we 


now test our hypotheses in the usual way, by putting eqn. (4) for L, in eqn. (10), 
otherwise we draw N —z further replicates and put 


Ly=N(ZeyMy + df] og? (20) 
i j 


in сап. (10) with 2,2 and v, being the values found after the first stage of sampling 
and the M; being the means of the N replicates. This procedure does not make 
the power of the tests strictly independent of oè, but it is a slight modification of 
one which does do so (i.e. if N<n in eqn. (19) make N=n+1) and it avoids 
wasting information (Stein, 1945, р. 247). 

The fact that $, involves no nuisan 
but a disadvantage of two-stage samplin 
found after the first stage may be impr: 


ce parameter is an obvious advantage, 
g is that, unless caution is used, the N 
actically large. Тһе size of N depends 
on the size of л, among other things, and it is possible to develop methods for 
optimizing the choice of n (Seelbinder, 1953, and Moshman, 1958). Here we 
note only that the probability is w of finding 


so the probability is w of finding 
6° 2 e, ә (22) 


in which F,_, has degrees of freedom va in the numerator and oo in the 


denominator, Substituting the right-hand sid f i 19), we 
have the probability w that а 
N> Aa уло F, Deyl]. (23) 
j 


This relation may be used to compute the likely upper value of М for any 7, 
given a reasonable choice of of 


Examples of ік 


8/' alternatives to the ox; given in Section 2 are 


Va ++ шз i. —100— 7% 
1K2—[4 -p= 4/245; 
1K977H3 =p = 4/24-5; 
16-05 


А 


Jl 
ok; used a common 0,2, then à 


"JO wv 
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Є š 
Жозе have written of type III errors, e.g. accepting ski when үкү is true, 
such an event will be sufficiently rare to be of no practical importance when 


1— В is small. When өкі is false the sampling distribution of F, in eqn. (10) is a 
non-central variance ratio distribution. The non-central distribution appro- 
s found by Fisher (1928) 


priate to one-stage sampling (8; expressed in с, units) wa 
and studied by Wishart (1932), Tang (1938) and Patnaik (1949). This dis- 
tribution has parameters уз, v, and, for the methods used in this paper, a non- 
central parameter v1A°q-p) "- The non-central distribution which applies in 
two-stage sampling (8 expressed in the measured units of Y) was given by 
Stein (1945, p. 251, eqn. 34). This has parameters v Уз and rA? a-p 

If we symbolize the type II error-rate by 1—£, the area to the left of 
Fq a" in either non-central distribution is (1 — В)” 


4. PRESCRIBED TESTS 


When a —1, we may choose c; and di, independently of the data to be used 


for testing, and test 

ок = qaa + di = 0 against ja = nula + d,= 8, and зк =сша + d= —à 

by using eqns. (4) and (10) with v, — 1 and уа-п-1. The degrees of freedom 
vy are for the mean-square within the sample, which is used as the estimator ó °. 
Тһе sampling distribution of Еріп eqn. (10) is the variance ratio distribution 
with v,—1, v =n—1 and v, X ^ =0 (the central case) when o, is true and 
n=1 xno 2X cj? (the non-central case) when either уку OF әкі is true. 


vig 
J H 

This test is equivalent to that termed “ the one-sample ¢ test " in elementary 

text books. If p independent investigations of this kind are done, each with 

error-rates x and 1 — B, the probability of rejecting r of the null hypotheses 18 the 

(r+ 1)th term in the expansion of (у + о)”, when all null hypotheses are true and 


in the expansion of (9-+ В)” when an alternative hypothesis is always true 


(y=1—a and 0=1- B). 

Frequently the hypot 
populations, with а> 1. Insu 
of the data to be used for testing, 


st in empirical science involve 4 
ch cases we may choose cy; and dy, independently 
and test eqn. (1) against eqn. (11) by using 
eqn. (4) substituted in (10). Тһе sampling distributions of e in еч. m ше 
error-rates and the probabilities of r rejections a ora in 2 = in vea 
the same as those above, except that we wou n е. п 2-4 m 


i G i lysi 
mean: square MS W) as the estimator of , as in апа) З 
= a(n ) ыы Mendes ch is in common use, is the 
2 —1). 


n of this procedure, whi 
so-called “ two-sample 2 test ”, in which a=2, but we test only one hypothesis 
(ок = ше : 

| 1f А mutually orthogonal hypotheses are chosen in advance to be tested by 
using one set of data (n random replicates from each of 42? pen hee 
ifyi enden! 
E the Se etd ны esimators of 
estimators of o, 006 for each hy E 
N 


heses of intere 


pothesis. 
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ithi 8;?), each with 
2 ossible in such data, the a within sample mean-squares (0:2), 

К е will satisfy this requirement. When the data are collected, e 
Буро» will be tested by putting eqn. (4) for L; in eqn. (10), with ó 2 гер a 5 
by Әр, v, 1 and ›„=л—1. The sampling distributions of Fi, the доа нт 
and the probabilities of r rejections of yi; are those given in the first paragraph o 
this section. | 

Since the power (8) of the eqn. (10) test of eqn. (1) against eqn. an 
diminishes rapidly (for small Уз) aS v, decreases and the 6,2 are all estimates 0: 
the same parameter (c), it is desirable to use, in eqn. (10), the single, common 
estimator of o, which is MSW with уз=а(п— 1). The use of a res 
estimator (MSW) may well increase the robustness of the tests and it wi 
simplify the application of two-stage sampling. When MSW is used the tests 
are no longer independent and the distribution of r rejections of gic; is, кин 
no longer Bernoullian binomial (though this is not an unreasonable, ane 
approximation to the distribution). Because ÈZ Li= L, for mutually orthogona 
hypotheses, and the F, 
Іт-УҒ/р is distributed i 
M= Уда (for those ox; which are true, ÀA?—0) When »,—1, then 
Fage 


i-« If it is anticipated that z or more of the ç, will be rejected, 
then Fm will be at least 1p 4,5 so the maximum probability of rejecting 7 07 
more of the oki is the area to the ri i-y< in the aforementioned 
variance ratio distribution 

pCa of rank P, were chosen, they may be 
type I and type II error- i 


1— В, but the maximum Probability of rej 


i 
have a common denominator (1x MSW), then 
n variance ratio form with у=}, Әз-а(п-1) an 


ecting r or more of the о now depends 


; For any given „Са а” 
the different possible selections of у of the oxi May yield different probability 
maxima, We may compute 


1" pCa aC, pt, рт (24) 
hes: : wa pom of r from ?. Тһе elements of pt, are 
су thie or those ғ of t е 

і 


ok; Selected and zero for the others. We define 


F to be the smallest of the (? ) values of eqn. (24). The maximum probability 
of rejecting r or more of the ox; is the area to the right of F in the variance ratio 
distribution with =}, Ye=a(n— 1) and №2=0 when all ок; are true, an 
Àm? given by eqn. (14) when all ік are true, i 

The non-central parameter д? 
numerical integration of the clas 
These values, for »=1x1 
Table 1. A more extensive 
author. 


а-в)” for alternatives to ori can be found BY 
sical non-central variance ratio шн aw a 
to 8, a=] -В-0-05 and Various v, are given he 
table of these quantities can be obtained from t 


| 
| 
| 


” 


Type II Errors and their Decision Basis 195 


TABLE 1. VALUES OF A*q.95», FOR а--0:05 


2 3 4 5 6 7 8 
25-708 26:356 26-980 27:575 
19:390 19-816 20-242 


w n=l 
4 24285 23-963 24420 25-048 
6 10 18227 18:303 18:593 18-974 

E 096 15-934 16:036 16:249 165 °. E 

10 16-101 14978 14699 14-690 14-799 Hen e 1992 

12 15-486 14-299 13:946 13:865 13-906 14-012 14-155 14321 

14 15:074 13:844 13441 13:310 13-303 13:363 13463 13:587 

16 14778 13:518 13:079 12-912 12-870 12:896 12-962 13:056 

14:556 13:274 12:808 12-613 12:544 12:544 12:585 12-653 

20 14383 13:084 12:597 12:381 12:290 12-269 12-290 12:338 

30 13.887 12542 11995 11717 11565 11483 11444 11455 

40 13:652 12285 11:712 11:404 11:223 11:112 11:045 11:006 

50 13:515 12136 11:546 11:222 11:024 10-895 10-811 10:757 

60 13:425 12:039 11:438 11-103 10:894 10-754 10:659 10:594 
оо 12:995 11:572 10:924 10:537 10:275 10-083 9:936 9-818 


orthogonal hypotheses are pre-specified to be tested by 
two-stage sampling, each with v,—1, and р independent estimators of oè 
(say 0:2) are pre-assigned—one to each hypothesis, then the type I error-rate for 
each true gx; tested by substituting eqn. (20) in eqn. (10) will be a. The 

will be the (r + 1)th term 


probability of r type I errors in p true null hypotheses 
in the expansion of (y + a. Stein's modified rule for N ensures that the type 


II error-rate for each true alternative will not exceed 1 — В, hence the maximum 


probability of r or more type II errors in p true alternatives is the sum of the 
first (r +1) terms in the expansion of (6--В)». 


The use of ё in eqn. (19) will generally yield a di 
erally make hypotheses, which were 


hypothesis. Such different №; will gen i 
orthogonal at the first stage, intercorrelated after the second stage. For this 
reason, as well as those given earlier in this section, the use of MSW is 

rejections 


recommended. If this is done the maximum probability of r or more 1 
of ок; is the area to the right of 7р7! B Qin the central or non-central ушш 
ratio distribution with v, =P, v = a(n — 1) and 2,220 when all ox; are true an 


№2 = Уу м2 when ап alternative hypothesis is а In this case the 
i . 
non-central distribution is Stein’s special form. | | 
The distribution of 7 rejections of уф with Р ae eee Le. 

i e-sta à 

hypotheses can be handled in th E apt 
f teaching 


As an example of the арр 
an investigation into the relati 
elementary statistics. The four proce 


(1) the use of a traditional type of textboo! 


(2) the same textbook, b 
(3) the use of a branched 


If p mutually 


ferent N; for each 


lways true. 


bed tests strateg 
‘ferent methods 0 


dures t 
k and lectures, 


ut tutorials in place of lectures, 


-program textbook, and 


(4) the use of a linear-program textbook. 
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Students’ knowledge of statistics will be assessed by a multiple-choice test 
administered after nine weeks of study. We specify 


š e= = 2 

ок = >H=, a= 7 Ha = c, /2, а-а-а ux 

ока =з pa =0, 1K2—p3 7 pa = 0V2, ake Mg" M7 = осу 

окз = t a — ha — p4 = 0, 1Ks—HiHa—]a—Hó4—29, окуш a — a — Ha 
=- 9.65 


and set «—1— f —0-05 for each of these. . . 
If we use MSW with Уз-а(п-1) to test all ок then the various v, in 
alternative hypotheses have AP=n; 


50 we must choose z to satisfy n> в 
Clearly n=14, giving Уаш 


4x 13 = 52, satisfies this requirement. 
Students are now assigned at random to the four treatment groups and, 
after nine weeks of study, theirattainment assessed. It is found that MSW = 10, 
Mj=11, 14, 12, 10, giving L;—63, 28, 31-5 and F,=6:3, 2-8 and 3-5 (j=1, 2, 3, 4; 
i=], 2, 3). Only the first of these reaches Fy 
favour of әу and the remaining ox; retained. А 
_ The values of kj implied by these decisions can be found by inserting 
1837 =(-o,4/2 0 0) in eqn. (17) which gives 
=8.—0.]ү2, рға |2, n, p. 
Тһе maximum probability of y= 
were true, is the area to the right of rp- Bae 
ratio distribution with у=р=3,0,=52 and Am? 
distribution gives 0-143 


+95 74-03; so ку is rejected in 


> 


n. 


=0, which is 0-270 (the binomial 

rejecting r—1 or more of the oxi): 

As an example of two-stage sampling an experiment is designed to investigate 

the effects of a placebo (X. amounts of the drug magnesium 

pemoline (doses 10, 20, 30 and 40 mg/kg being X, to X;) on the discrimination 
learning of rats, peritoneal injection. It is planne 

omponents of trend will be tested. 
f шу (the mean learning score 0 


окт ~pa — He + Hy 29, 0, їкү= —2y, — = 20. 
oK3—72u4 -p — 2ps =p + 2ш-0, ies = Жыны. 5 ыды, 2% а 4/560, 
окз= “Ht 293 25, - i, — 0, 163= Hy + Qty — 2p, +05= 20, 
ока = ш — Apa + bpa -4ш %ш-0, 1&4 = pa — 4p, + б a dun -104/28, 
21 = — 2u — ug +ра+ 2us= —20, 
2K2= 2p, “H2 —2us “Ha + 2us= — 4/560, 
2Kg— THa + ha —2u, +h, = sep. 


sa Pa рабаз E T 
We set «=1—В=(0-05. Wu T 


s 
Table 1 are equivalent to n=2 by steps of 0:5 to 6, 8.5, 11, 13:5, 16 and оо. АШ 
uw 


“ 


| 
| 


- 


P. 
| 
| 
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When v, — 1, Stein’s non-central parameter has the value 
Лав = (һе + ГА (25) 


in which ż is Student's statistic with v, degrees of freedom. 

If we used n=9, then v,—a(n—1)—40 for MSW and 2%). s =(2:021 + 
1:684)? —13-727. We might have some preliminary evidence that MSW is not 
likelytoexceed 50. Since № —40N/MSW foralltheabovehypotheses, then from 
eqn. (19) we obtain N = [13-727 x 50/40]--1—18. If we insert о,2=50 іп eqn. 
(23) with w = 0-05, the probability is 0-05 that N> [13-727 x 50 x 1-69/40] + 1 —29. 
These N's are considered practicable; so л —9 is chosen for the first 
stage of sampling. Note that if n were increased then, all other things equal, the 
expected value of N would decrease, but there is a risk of overshooting the mark 
and finding N <n at the end of the first stage. Increasing z too far would be 


uneconomic. 

After the first stage of sampling an=45 animals, the experiment is run and 
the value MSW —35 obtained from an analysis of the learning scores. From 
eqn. (19), N=[13-727 x 35/40] + 1=13 is computed. Four more replicates are 
now drawn for each of the a=5 treatments, these animals treated in exactly the 
Same way as their predecessors and the observed mean learning scores "e the 
N —13 replicates from each population were М;-205, 22, 2219, 23, 27%, eqn. 
(20) gives 1,--292-5, 23:2, 32:5, 3-79 and eqn. (10) gives F, — 8:36, 0:66, 093 ni 
011(j21,2,3, 4 5 i=l, 2, 3, 4). Only the first of these is larger than 
Куе =4-08; so ок 18 rejected in favour of уку and the other gx; retained. 

The values of p; implied by these decisions can 


be found by inserting 
8,7 = (20 0 0 0) in eqn. (17), which gives 


-2, ps + pA. 


мтет d | 
errors if all оке Were true is the 


The mast ‘lity of r=1 or more type I ó + dieributi 
area es right ener N =1x<408/4=1:02 in the variance ratio distribution 
is 


' ісһ 15 0: inomial distribution 
a y=p=4, 9-40 and Am?=9; which is 0 409 (the bino 
Bives 0-185). 


- It should be noted that 
ejected may be much larger t : i 
Ce ° i Ct ok: 
tion of oj The minimum required to ICI 91 
Wo-stage sampling il tion is 
npling illustratio 1 
: i uni 
The difference between the minimum amo 
among other 


amount accepted depends on 1-В, 
his difference Бес 


, 


thesis when ок is 
hypo ary for 


thing. + 
he “ price 


fro É 
m а truncati Analogous Ге c, in д 
ation factor. he unknown б, 
Case, b ; he presence of t he M; in the 
» but th licated by the P аңабоп among tHe ^7 
Vues that there is enough variation Had we chosen à 


It is interesting to note 


One.. 2 М 
Stage sampling example to reject t 
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different set of hypotheses, we might have found L 740-8, L,—40-8 тт 
L,=40-9. These account for the same М; variation as the 14 we found, es 
SSB=XL,= 122.5. Our hypothetical set of Li give F;=4-08, 4-08, 4-09 an 


all three of the null hypotheses would have been rejected. In the eer 
sampling illustration the variation among the M; could support the нем 
two appropriate x; chosen in advance. The choice of oki that was actual ly R 
has over-concentrated “ significance ” in оку and there is a sense in which t 

choice was unwise. In the formalities of decision, making a large difference 
between F; and Ға) ^ has the same effect as a small difference and, for iin 
reason, such large differences waste information. This is the stronges 


5. UNRESTRICTED TESTS 

In this section We assume all 2,0), 
specifying values of mj and Carrying out 
methods of this section—the Y; being the 
An infinite number of different sets o 


rank p, are possible for @ population means. Here we elaborate a method 
given by Rodger (1967) for testin 


E P hypotheses, the choice of which is in- 
fluenced by the observed Mi. 


testing hypotheses Suggested by the q 
statistical criticism Which claims that 
sampling error. We take this to mean that the w and 1— В claimed are false, the 
former, at least, bei 

seek gx; to reject. 
since the actual €rror-rates (or the expe 


are the « and 1-8 Pre-specified. "Тһе Second criticis 
that a set of Statements (decisions) wit 
scientific value, But w i 
М; it is not inevitable that t 
the research Worker has а ionale for more than one set of p hypotheses 
with »Cq of rank P and he has diff i 

methods of Section 4. 
empirical science, 


This does not prevent us from pre- 
an analysis of M;= Y;+mj by the 
observed sample means. . 

f p null hypotheses, with pCa having 


little rationale, 


d ich 
„р, 10 restrictions were Placed on the kind and number of hypotheses wbic’ 
might be chosen Post hoc for formal decision making, it would be possible на 
state only very crude bounds for the error-rates x and 1-8. We shall adopt t 


< 


” 


| 
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- that as many ок; as possible be rejected and, initially, restrict the decisions 
о be made to p mutually orthogonal hypotheses. If these gx; have at least one 
У су 0, we make р =a, otherwise, for contrasts only, we make 5 —a — 1. 


Consider independent tests of ,A;—0 with a decision rule which ensures 
that а and 1—f are the error-rates for ок; and ук; respectively. Under these 
conditions the probability of retaining all оку is (1 — о)” when all к; are true and 
(1— В)” when all ік; are true. The retention of all өк; implies the retention of 
oAm=0. The probability of retaining „Л = 0 when it is true will be (1— o)? if 


we reject it when 
Fn =Lm|vy,MSW > Fu a (26) 
with v, =р. But when the L; are derived from mutually orthogonal hypotheses 
6.2, then 


and the Г; in eqn. (10) uses MSW for 6, 

Fm =D LI MSW => Fi. 

The use of a common error-variance estimate is advisable for the reasons given 

for prescribed tests in Section 4, but its use makes the tests no longer independent. 

The use of Li, chosen post hoc, in eqn. (10) with ¢2=MSW and v, =) is 

the R technique introduced by Rodger (1965 and 1967). We maximize the 
number of ок; rejected when the number rejected is 

q— [FnlFa.). (28) 

the square brackets indicating truncation of the fractional part. If eqn. (28) 


(27) 


yields g>v,, we make q—»,. 

The probability of rejecting r null hypotheses is the area between rFa-a” 
and (r+1)Fq_,)" in the variance ratio distribution with v, =f, v» =a(n — 1) and 
Аһ? —0 when all ок; are true and Ag? — XA? when some of the p alternatives аге 


- i babilities are 
true (for r=v, we make (r + 1)Fa 9^7 оо). If the resulting pro i 
symbolized by В, Pay ov sy Бы the expected proportion of type I errors in 
D =v; true ку tested in a single application of the R method is 
(29) 


E(a)= ХР 


indicates опе of the effects of non- 


е іші ДІМ eing the form of distribution of 7 


The discrepancy between b 
Independence of the tests (the other effect hal. integration 80 far 
rejections of ше. In the cases investigated БУ тіле om to Bla) —B(a) 
¥1=2 to 20, vy = to © and а-0:05), = !Š 8 8 ite satisfactory for 


—certainly qui 
Tanging from 0-078 to 0:033 when у <= certainly 4 


Practical purposes. š 
If the alternatives to all p pe ria A 
Probability will be (1 — 8)” of retains 0 20 
the left of Ta аг (1—8)" Bi Шы ек агеав 
Ye=a(n— 1 2-у My)" E T=v. 
Ginn E v^ T 1. 2 pt and (r+1)Fa-9"= 9 when E 
1-а)" =U, jy s + 


and all кі are true, the 
) when the area to 
tio distribution with =P 


between rF(.. ant 
)in this 


qual value of А 
=0 (hence all өкі 
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d 
istributi iliti jecti he өкі when all үкү are true ап 
distribution are the probabilities of rejecting r of the grg dg 
ine may be symbolized by Oy Os... Q,. Тһе expected proportion 


, $ ‹ 
rejections of џк; in a single application of the R technique in this case is 


FB) т0н ен 


Numerical integration of the distributions with the Ур va and A; given in 
Table 1 yield E(B) ranging from 0-984 to 0-937 (B — 0-95). 
We may choose F. in place of FQ y» as 


e ize 
à critical value and symboliz 
the successive areas between TF, and (r+ DF. i 


n the variance ratio distribution 


with v=, v,—a(n— 1) and 2,2-0 bY то, mo... т,. An appropriate choice 
of F. will yield 
E(a) = > "туу = a. (31) 
т=0 
Тһе Corresponding areas in the variance ratio distribution with "=P; va = а(п P 
and Am? =r, S, may be symbolized as Pos 4, ..., ¢, An appropriate choi 
of A, 2 will yield 


L^ 


E(B) - Ў р.р, (8%) 


r=0 

Values of F, and A,” that Satisfy eqns. (31) and (32) for popular values of « and 
В are currently being sought, 

Stein (1945) discussed the use of hi 


à iance 
š š SPecial form of non-central varian 
ratio distribution when "i21 and Healy 


d 
(1956) showed how it could be use 
r Simultaneous estima 


design experiments for two-stage sampling 
and tested by ће R technique. We draw n r 


а to estimate g 2 by MSW with va=a(n—1). We then draw 


2 
N= О%Мв8Уу/5%) +1, pa 
in which 57 is the value of each alter 
assumed to have the 


. аге 
Native to normalized hypotheses (all ук+ 
not been numerical] 


Н istribution 
a ortunately, not only has Stein’s distribu 
Y Integrated to find valu 


Е a 
to set the variable to the “ apparent ” or the projective size of the stand 
The various conditi 


ons are shown in Table 2; 
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TABLE 2. CONDITIONS OF HYPOTHETICAL EXPERIMENT ON SIZE CONSTANCY 


Group 
1 2 3 4 5 6 
Viewing Reduced Reduced Full Full Full Full 
Angle 5° 60° 5° 60° 52 60% 


Instructions Apparent Apparent  Projective Ргојесіуе Apparent Apparent 


We are interested not only in differences between treatment populations but 
also in the extent to which the means of populations approach projective equality 
to the standard (which will happen when the variable is set at 10 cm); so we use 
a decision set with »,=p=6. We set a=1—f=0-05 and A?—z. Since we 
shall use 6,?— MSW іп eqn. (10) then v, = (п — 1) and the various уҙ in Table 1 
represent n=1§ x } to 44, 6, 78, 94, 11 ап4 о. If wechoosez = 11, then v, =60 
апа 2%, o |6=10:754<2. Eleven subjects are therefore assigned, at random, to 
each of the six treatment groups, the experiment is carried out and we observe 
MSW —25, Ӱ;= 11, 12, 12, 13, 14 and 15 ст. We make ту- —10 cm (to 
adjust for projective equality) and define М; = Y;+mj=1, 2, 2, 3, 4 and 5 cm. 

These data yield 


USSB = Y; M;* = 649 
j 
SSB -2YX (M; — М.) — 119-16. 
j 


When divided by v, MSW = 6 x 25, these quantities yield 4:33 and 0-79. Since 
Е,.550= 1:31 we may reject 


q- [433/131] - [5:3] 23 


null hypotheses. Since |0-79/1-31|--0, we shall not be able to find any contrasts 
that can be rejected by the eqn. (10) decision rule, After a few trials of eqn. (10) 
with the M; we settle, on the following mutually orthogonal hypotheses as our 


decision set: 

ofa a tHe d a 0, L,-9L6,  F,-061, Retain ya; 
І,-2695, Е,= 1:80, Reject grz; 
ока == 0, І,-2750, Е; = 1:83, Reject xs; 
"EU L= 55, Fy=0-04, Retain ока; 
оку= 2 hatu =0, Ly 73, Fs=0-05, Retain ocs; 
L = 0, F,=0, Retain pg. 


the above decision set gives 


око = +s = 0, 


оке 7 Ita 7 Ha = 0, 
When X-za we have 8j-o,VXc; so 
8,20, c,4/2, o, 0, 0, 0. When these quantities are inserted in eqn. (17) we 
obtain ш-0 0, 0, odv c | /2, Oe 
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In spite of the fact that we could not directly reject 2ш t2 is ты а 
3рь=0, this hypothesis is false by implication since its value is | + ШҚ 
3e,|4/2—3o,|4/2— -6,4/18. The most plausible Teason why this пуро on 
could not be directly rejected is that its 3— — 7,4/18 gives А2 = 0-62 and we wu 
using \?=n: our test did not have enough Power to detect directly the fact 

i is is false. қ 
k Се pr rejected as many hypotheses as the variation among the и, 
justifies. This was because we could not find three mutually MU aa 
hypotheses which had a simple interpretation and could be rejected. А con 
Servative decision set will commonly occur in the application of the R technique 


with Р;> 1-31. One interesting set is that with q values of Fi=Fm/q (+33 [3 in 
our example) and the remaining p — values of F;—0. Such a set is uniquely 
determined by the rules that it contain mutually orthogonal hypotheses with q 
of them having Fi=F,,/q and the remainder F,=0. After much tedious 
algebra it may be shown that this set yields 


=o. MyV(go._»»/USSB), (34) 
when its 8; and pCa are inserted in eqn. (17), 


If the decision set is restricted to 
contrasts, (34) becomes 


н-һ-о(М)-М)у(өде yn [SSB). (35) 


In our example, eqn, (34) gives у =0:226o_M;. 

If all к; were true, the Probability of rejecting r=3 ог more of them is the 
area to the right of ур ‘31 in the Variance ratio distribution with 
11 =P =6, v;—a(n—1)— 60 and )n* —0, which is 0002. If ¢=3 of the alternative 
hypotheses were true and the remaining <; true, the Probability of rejecting 
r-2 or fewer o is the area to the left of (r+ ШЙ, чыйк 1:31 іп the variance 
which je (194 са а анан Ds and М2 д8 gyn —=3 x 11, 


There can be no Serious objection to using a set 


ch have pCa with rank 
Бола! basis for the R method 
by eqn. (28), of the oxi and the no 


accepted is (12, 
earlier. We will assume that 


of hypotheses which are not 
Р, as the formal decision set. 


oxi are those given 
malized, since this simplifies the 
14% accepted. The 5; must satisfy 
ЕСТІ у. 
qÀ a-9^ —m8,"( C. аС,?) bo sr, 2910,72, (36) 
If we accept the earlier view th 


t at all A? be 
being equal in eqn. (36) fo 


made equal, this is equivalent to all à; 
oye accepted 


© Of course for oxi accepted 5,=0. 
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With these conditions we find the solution from eqn. (36) to be 
8— о, (qv [nu) (37) 


in which и is the sum of the elements of the sub-matrix of i 

when the rows and columns of pUp, corresponding to x; s 
Whereas with mutually orthogonal null hypotheses it is impossible to find 

more than q (28) which are rejectable by the (10) rule, this s possible with null 

hypotheses which are intercorrelated. We ought to modify the Ру os 

criterion for уку which are intercorrelated. This could be done by adapting (2), 


but it seems an unnecessary refinement for practical application. 


6. SUMMARY 

e variate Y is distributed normally, with 
means uj (/-1,2,....2) and common variance a, the decision rule proposed 
in this paper for linear statistical hypotheses about the pj is to reject null 
hypotheses for which F,=Li [MSW > Еи" When p null hypotheses, 
with pCa of rank p, are chosen, independently of the data to be used for testing 
them, y, = 1. If theselection of hypotheses is influenced by the test data, v =? and 


we may reject -ПҒа/Ға-а%159 of them. 
It is recommended that the sample size (п) be chos 


hypotheses from reasonable alternatives with known р 


Don-centra] parameters which may be used to assist the choice z are available 
mpling is used, alternatives to null 


from the author. When one-stage sar J 
hypotheses must be expressed in o, units, but with two-stage sampling these 
alternatives may be expressed in the measured units of Y. Tables of non- 
central parameters appropriate to two-stage sampling are available, as yet, only 


for v, =] 
171. 
ЖТ ; nly allow 
Тһе methods given in this paper for testing null hypotheses not on y. 
us to anticipate us type I error-rate and power (or the expected proportion of 


rejections of null hypotheses) given statements of what is true 5 uc dan 
they also allow us to calculate the probability 0 ° 7 


(O<r<v). 


Given a populations in which th 


en to discriminate null 
robability. ‘Tables of 


f r rejections 
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FACTOR INTERACTION IN NONLINEAR FACTOR ANALYSIS 


By RODERICK P. McDoNALD 


University of New England, N.S.W., Australia 


Factor interaction models are defined as those cases in nonlinear factor 
analysis in which the specification equation contains products of two or more 
latent variables or functions of latent variables. А complete algebraic treatment 
is given for the case of a product of two latent variables, and certain more general 
cases are briefly outlined. 

1. INTRODUCTION 
, 1965 b, 1967 a, 1967 b, 1967 c) has developed a 
hich allows for nonlinear, possibly nonmonotonic, 
d variables and one or more latent variables or 


McDonald (1962, 1965 а 
treatment of factor analysis w 
relations between the observe 
factors. 

Bartlett (1953), in a discussion of the linearity hypothesis in factor analysis, 
noted that if we include the product of two existing factors 25 20 additional 
term in the specification equation of linear factor analysis, it will act just like a 

the correlation properties of the observed 


further common factor as far as f t 
variables are concerned. He further suggested that to discriminate such a 
product term from a genuine third factor, it might be possible to examine the 
correlation between the suspect term and the constructed product of its posse 
component factors. Bartlett noted also, however, that such an inquiry м ; 
be complicated by rotational indeterminacy, especially in the presence o! 0 he 
genuine factors. the factor model which 

roduct of functions 
r is to present an 


Let us define а fa в a term in Í 
contains the product of two or mo mper the P 
of two or more latent variables. ect of t ін Бары 7 bo paredes 
extension of the theory given b McDonald (1962, 1 і d deos pm 
certain cases involving factor interaction. n жосық дее к 
simple case described by Bartlett, followed bee e 

theory, and a constructed numerical example of the 5! р 


ctor interaction a 
re latent V 


2. THE SIMPLE INTERACTION MODEL m и 
і in which one 
ill be considered in w 
cape a atent variables. 
өзінді — can be represented as a product of two l 
onal com 
н" жаза іы (1) 
= aj: охо +818127 O M 
gj ani Y 2 x 2) are the model parameters, and zp X 2 6 
cared is a standardized observable variable, 
dized mutually 


where a ап 

4p 4j» ајз h 2; 
2 9 r, each 5j 

аге random variables жайты» n; Xp are standar 


ie., E(z;)=0, B(a)= 1, jah, TI 


Xs 
а 
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statistically independent latent variables; each еј is a random 152 Ін 
Е(е)-0; and the ej /-1,2,...,п, аге statistically independent of each o 
“ сах the strong assumption that the latent variables хі, x, are mutually 
statistically independent, the three components Xy Xg and xX are indeed 
orthogonal, since 

E(x; . 55) = E(x,2)E(x2) =0 
and 

Е(х. 2x3) =E(x,)E(x,*) =0. 
Also, the component xx, is normalized, since 


E((,29)?) = (ау (02) = 1. 
Model (1) may be contrasted with an alternative 
2j — арх + ах» + 4454 + еу, (2) 
with analogous notation, where each pair of the latent variables Xis Хо, Хз IS 
statistically independent. 
Both model (1) and model (2) are consistent with the weaker model of 
orthogonal factor analysis 
24 = бушу + уо + bju + ej, (3) 
where it is specified only that 
E(vpvq) =1, for all p=q 
=0, for all 5 q 
and 
E(v5ej) —0, for all 2,j, but v, ә;, Us 
independent of each other, 


Two n x 3 matrices of factor loadings A and B are now defined whose rows 
are given respectively by (ал, 


ағ» а] and [bj by, Б). "These matrices are 
assumed to be of full column rank. There exists a 


are not necessarily statistically 


3 х3 orthogonal matrix 1, 
such that ни 
A=BL, (4) 

and, correspondingly, for model (1) 

[x » Xo, Xx, | = » Uo, ў (5) 
while for model (2) шыны 

[хь хь x] = |р, vs, ој, (6) 
That is, supposing that 


given population, then it may 
or (2) applies to that population. 
the factor loadings is given by 
determined uniquely except for 
int moments of the orthogona 
е second. Under the additional 
2 Vanish, simple relations can be 
ions in the plane. 


f the stronger models (1) 
; the relation between 

applies, the matrix 1, is 

reflections by certain jo 
of higher order than th 
d moments of X, and x 
alytic solution for rotat 


eqn. (4). If model (1) 
trivial permutations or 
components (о), о, vs], 
assumption that the thir 
obtained yielding an an 
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rmation L corresponds to a single 


If, in particular, the orthogonal transfo 
it is easily shown that 


rotation of [v,, 9) through an angle of 2, 


- cos 20,, = E(v,?3?3) (7) 
Thus, from addi sin E азы; 
[Py ?» НЕМЕЛЕР xx] | бө Sie 0] 
| зан ба 0 | (8) 
б ООП 
where су» = соз бу, зуе 7 Sin 15 it follows that 
E(v;v53) = 41 2 E(x) 75 32 Е(ху2хз2) 
(9) 


= cos 2012 
ressions analogous to eqn- (7) can be derived for planar rotations of 


Exp 
[v 931 апа [vs, vs) viz., 
E(v,?373) = cos 20,4 — cos біз sin 0,4 E(*2°) 
=cos 201 (10) 
under the assumption that 
Е) -0 
and 
Е(,9:93) = 0$ 20,4 — cos без sin баз E(*1°) 
= cos 2023, (1) 
under the assumption that 
Efe?) =0. 
h corresponds 


ast-squares procedure can be developed whic 


Alternatively, ale 
McDonald (1962). 


to the theory given by 
Writing 
[n* vs*, ыы [61201 7 51302» 54901 t 1202 vs] (12) 
0, is chosen to minimize the function 
Ф- E((vs* = АЛЫУ) ы T 
ponen ci ) “ко 
i i i ion, 
It may be noted that if the model Б рй t oa ers 7 
i insi is i ically zer ху Хә ( 
E gom га €: i He ence, the minimum value of ® will be 
osen such that 9: ="? 2 mots 
(14) 


opulation. 
E(vs) = 


precisely zero in such à 
#2 
E(x"? 22”) 


whence жо”) 
-2-2Е(а ir 2—51,2)010208), (15) 


o 
-2 _2E(a ena ( or 95 ооз) t (018 
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so that the function Ф will be a minimum w 
E(v;*v;*o,*) is a maximum. 

Setting the partial derivative of with respect to 0 
eqn. (15), it is found that 


Ф.Х. 2, 
tan 20,,- Е(о,2оз) Е(ә,%;) 


hen the trivariate moment 


1» equal to zero in 


16 
2E(v,v,v,) (6) 
and by symmetry, for single rotations of [21, v], [va 23] through angles 05, бъз 
respectively, 
tan 20,, 2092202) - E(ojto,) (17) 
2E(v,v,v;) 
and 


E(v,?v,) - E(v,"0,) 
= eas 18 
tan 2605 2E(v, е) š (18) 


| Pared with eqns. (7), (10) and (11). Тһе 
former equations, together with the Conditions for a minimum, determine the 


his restriction of range corresponds 
rminacy of L noted above. 


1 and x, through an angle of 45°, model (1) can be 


LN "E „ж cT X 
S=, EA le ag ^: ) (x, х%2) 


in this €quation, the 
nction in x,* and 


model may then be expressed as the 
might appear to be transformable i 


а quadratic function in ху, so that it 
: the class of polynomial 
г papers, However, it is a degenerate case 
n the rearranged €quation are not linearly 

“and x,9? differ only in sign.) Eqn. (19) 
à product term ух, is not an invariant 
Property of the model, so that an inte а of the action of one 
I (cf. Saunders, 1956) is not entailed 
Properties, However 


» the degenerat lynomial case 
represented by eqn, : в € poly 
original form of the i certainly appear less parsimonious than the 


3. More GENERAL Casrs 
S now Considered: 
a= ах + ах 


Тһе following model i 


2 ajawa + ajaa + ej, Q0) 


orthogonal component has been 


ү + а latent Variable x, which is assumed 
las 
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Again comparing the model (20) with a weaker model 


М 2j = Бро + bjsgs + 0508 + Оа ep 
whose components 21, 9 © 
p Voy Ug, V4 ALE mutually orthogonal, it is evi 
ia 5 | t gonal, it is evide 
sagen Ө Section 2 will hold for simple rotations of т with v 2 — th P. 
= a with vg. . In addition, it is easy to develop a least salare s uk 
zc Pm is the above, for simple rotations of v, with v4, va with " ces 
H + 2 
од. Thus for the case of a single rotation of [o v4], it is sedile io’ was 
e 
à =2—2E(2,*02*05*) 
-2-2Е(сц9- 5432s)? 2s) 
= 2 —2¢4E (0192s) + 24E(v 20304). 
respect to 04 equal to zero yields 


(21) 


(22) 


Setting the partial derivative of Ф with 
E(vsv374) 
tan 0,7 — miu. 

Е Е(о,9:94) e) 


which, together with the condition for a minimum, determines 0,4 uniquely. 


By symmetry, 
E(v,v3?4) 
tan 0,,— — кий эё эй 
кш Еа) (24) 


E(v,2 2) 
баз- DUM be 
tan би — озо) @5) 


rmation obtained by maximizing the prescribed 
to an arbitrary permutation of the 
he left in eqn. (5), say.. 
n on a finite sample of factor scores, 


mine which column represents the product term. In the 
in the component variates due to unique variations (a 
btain in practice; see McDonald, 1962) this decision 


situation which does not o 

is easily made by inspection of the transformed matrix in graphical or tabular 
nce of such ‘ errors ^, a number of asymmetries in the 

ts could be employed as a numerical 


the transfo 


determined only up 
n by the expression on t 


As noted earlier, 


trivariate moment is 


nts of the model as give 


eleme 
ven such a transformatio: 


Suppose that, gi 
it is desired to deter: 
absence of ‘errors’ 


form. Also, in the abse 
magnitudes of the joint or univariate momen 
n of the product term. For example, it is immediately 
the product of the 


oduct term equals 
ver, the effects on such quantities of 


no strictly unequivoc 


criterion for identificatio 
the fourth moment of the pr! 


obvious that 
fourth moments of its arguments. Howe 
i iance are quite complex, 
(24) and (25) have 


jon has yet been found. 16, ал, (18) (3) 
. (16), (17), Eh Donald, 1965 b, 1967 a) 


FAINT (McDonald, 

s to rotate à sample matrix of factor 

me group of three successive 
S.P. 


* errors 


decision criter! 
The relations SUP 


been combined in a computer 


to form an itera 
scores, so that the sam 


о 
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columns of N observations on v, 


1/8 
My = N ( > Vi,kUL kii, kpo ) (26) 


i= 


will be a maximum or, what is the same thing, the sample analogue of Ф as given 
by eqn. (13) will be a minimum. In practice, it is recommended that sample 
component variates as defined by McDonald (1962) be employed. It is easily 
shown that the relevant sample moments of these are unbiased estin 


nates of the 
corresponding moments of the ‘true’ factor scores. If the data conform to a 
, model such as 


817 Ay + apx Барлы аду аху арх... е), (27) 
a rotation will be applied to the estimated fact 
the prescribed relations between the columns of the matrix to be detected. The 
rotation obtained by maximizing 7 can then be applied to the factor loadings 
to complete the estimation of the model parameters. 

The program, at any iteration, uses relations of 
(17) and (18), respectively, to rotate columns k with 
k+1 with k+2, and uses relations 
rotate each of the columns k, 
score matrix. 


ог score matrix which enables 


the type of eqns. (16), ` 
k+1, k with k+2, and „ 
of the type of eqns. (23), (24) and (25) to 
k+1 and k+2 with each other column of the factor 
This procedure is iterated until no rotation of pairs is indicated 
to be necessary, of greater than a size prescribed in degrees by the user. 

The above remarks generalize readily to the model 


Z=... Жауы а, қыла EAJ ep psa HA) ey SENE Xp a + --ер (28) 
and the like, іп which triple products or higher order products of latent variables 

Àn important restriction on these 
models is that each of the factors making up the Product must also be present 
he model. Typically, if the model 


I т... із (ор +++ Uk um). (29) 
Finally models of the following type are considered: 
where Toda imb) + атах) + aps see) +...ep (30) 
Езу) — 1, р = 
=0, рад. 
That is, in place of н 


orthonormal functions 
analysis, without inte 


Up X»... there are a number of 


A treatment of nonlinear factor 
г f orthonormal polynomials, has 
been given by McDonald (1962 19 9 poly , 

be noted immediate} ( заь 1967 9). 


: а irectl 
to model (30), provided that each Руш. plied to model (27) will apply direc ly 
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also present іп the model as a single orthogonal component. А general computer 
program for nonlinear factor analysis, which includes optional specification of 
models of the type of eqns. (28) and (30), without any restrictions as above on 
the univariate moments of the latent variables, is currently under development. 


4. А CONSTRUCTED NUMERICAL EXAMPLE 


In this section a numerical example corresponding to model (20) is given. 

For this and other applications, a general test program, ARTY, has been 

. written to generate a matrix Н of factor scores, which will have the properties 
prescribed by a specific nonlinear factor model. The number of columns 
(factors) z and the number of rows (cases) 2 >r are prescribed, and the elements 
of each column of the matrix are generated, either as sample values of a normally 
distributed random variable, or as values of a constructed polynomial of 
prescribed degree in a column previously generated orthonormal in the sample, 
or as a product of two or more previously generated columns. The program 
then generates a random orthonormal r x r matrix L, and finally postmultiplies 
H by L, yielding an пху matrix V of randomly rotated factor scores. These 
are punched on cards in a format suitable as input to the nonlinear factor 
analysis programs (McDonald, 1965 b, 1967 a). 

A 50x4 matrix, generated by program ARTY, is shown in Table 1. 
This matrix was constructed as a random rotation of a matrix corresponding to 
the model (ху, xs, x35, хз). А fifth column on the right of the matrix contains 
the constructed product of corresponding elements in the first and second 
columns of the matrix. The correlation between this constructed product and 
column three of the matrix is — 0-284. 


TABLE 1. Input FACTOR SCORES AND CONSTRUCTED PRODUCT 
or First AND SECOND FACTOR SCORE 


I II III IV IxII 
1 —425 2-98 1:01 3:77 -12:69 
2 —031 1:70 -147 1:22 -0:55 
3 0:54 0-74 -0:26 -0:89 0-41 
. 4 0-15 — 0-60 0:67 0:52 —0-09 
5 0-00 —0:05 1:06 — 0:29 0:00 
6 0:63 0:85 -061 - 0:76 0:55 
7 0:18 0-40 0:65 -0:85 0:08 
8 0:67 0:92 -0:82 -0-65 0-63 
9 —0-79 --1574 —243 0:07 1:38 
10 0:57 1-22 -144 0:01 0-70 
11 -0:78 --1%75 —243 0:08 1:37 
12 0:00 —014 1:05 —0:16 0:00 
13 0:30 0:52 0:38 —0:93 0-16 
14 0:53 —1-40 -0:87 1:34 -0-74 


(Continued) 
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TABLE 1 (continued) 


I п ш IV ІхП 
15 0:15 -0:62 0-66 0:54 -0:10 
16 0-26 -0:82 0-36 0-81 -0:22 
17 0:15 1-49 —1:58 0-70 0-22 
18 0-03 —0:32 0-96 0:11 -0:01 
19 0:34 0-57 0-26 — 0:94 0:20 
20 0:30 0:52 0:38 —0:93 0:16 
21 —1:20 - 1-70 -2:08 -0:31 2:06 
22 0-00 -0:19 1:04 -0:08 0:00 
23 0:00 -0%01 1:06 -0:35 0:00 
24 0:00 -0-02 1:06 —0:33 0:00 
25 0:20 —071 0:52 0-67 —0415 
26 0-04 —0:33 0:96 0:12 -0:01 
27 0:09 —0-48 0:82 0:34 - 0:04 
28 0-66 0-89 -0-74 - 0:70 0-59 
29 0:01 0:09 1:02 —0:51 0-00 
30 0:51 —1:54 -124 1:34 -0:81 
31 0-07 0-24 0:89 -0:70 0:02 
32 0:63 115 —1:35 -0:15 0-74 
33 044 1:33 -1:54 0-28 0:59 
34 0:56 0-77 -0:34 -0:86 0:44 
35 0:33 -0:94 0:14 0:95 —031 
36 0:07 -043 0:87 0-27 — 0:03 
37 0:38 —1:03 —0-03 1:06 -041 
38 0:05 -0:36 0:93 0:17 -0%01 
39 0:03 - 0:30 0-98 0-08 -0:01 
40 0-29 0-51 0-40 -092 0:15 
4 0:37 -101 0-00 1:03 —0-38 
42 0-14 0-34 0-75 —0:81 0-07 
43 0:60 1:19 —1-41 —0-05 0-72 
44 0:26 0-48 0-48 —0-91 0-13 
45 0:19 — 0-69 0-56 0:63 0-13 
46 0:66 0:90 -0-77 —0:68 0:61 
47 041 -1:09 —0-14. 111 -045 
p. 0-18 —0-66 0-59 0-60 —0412 
ы — ш NE NP NA- 
Site -062 —3-48 5-03 


. 3 i able 1 was input to program FAINT, 
with a prescribed Precision of two degrees That is, the criterion for convergence 
other through an angle greater than 
Ctor score matrix, again with a fifth 
ining Tucted product of elements from the 
first and second columns, ‘a E in Table 2. The correlation between this 
tructed | umn three of th ixi . i 
rotation is given in Table 3, шаны "de ms 
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Тавів 2. ROTATED FACTOR SCORES AND CONSTRUCTED PRODUCT 
or First AND SECOND FACTOR SCORES 


T II III IV IxII 
1 —3-23 3-14 -4-67 0-32 —1044 
2 -023 1:94 —0:46 1:63 -045 
3 1:09 0:57 0-40 —045 0-62 
4 —0-84 -043 0-21 -042 0:36 
5 —040 —0-15 —0:07 —1-08 0-02 
6 1-12 0:73 051 -. 019 0-82 
7 0-66 0-16 0-05 —0:94 ` 0:11 
8 141 0:84 0-55 0-42 0:93 
9 0:20 -1-70 -0-33 2:27 —0:34 
10 0:77 134 0-46 1:19 1-03 
11 0:19 —1:70 -0:32 2:28 —0:32 
12 —0-23 —0:20 —0:05 —1:03 0-05 
13 0:85 0-29 0-16 -0-73 0-25 
14 -124 -0:88 0:84 1:33 1:09 
15 -0:85 —044 0:33 —0:40 0:38 
16 —1:04 —0:54 0-39 —0-02 0:56 
17 0-24 1:67 0:03 1:54 0-40 
18 —0:50 —0:30 0:02 —0:84 0:15 
19 0:91 0:34 0:21 —0-63 0:31 
20 0:85 0:29 0:16 -073 0:25 
21 054” —1:83 —0:76 244 —0:99 
22 —0:31 —0:23 —0-03 —0:98 0:07 
23 —0-03 -0413 -0:08 -1410 0:00 
24 —0:05 —044 —0:07 —1410 0-00 
25 —0:95 —0:49 0-30 -0-22 0-47 
26 -051 -0:30 0-02 —0:83 0-15 
27 —0-69 -0:37 0:12 —0:63 0:26 
28 1:12 0:79 0:53 0:32 0:88 
29 0:14 —0:06 — 0:08 —1413 —0:01 
30 —1415 —1:00 0:88 1:70 115 
31 040 0:03 —0:04 -1:%09 0:01 
32 0:88 119 0-52 1:04 1:05 
33 0-58 1-45 0:32 1:36 084 
34 140 0:60 0:43 —0-08 0-66 
35 —143 —0:60 0-49 0:22 0:68 
36 —0-63 -0-35 0-08 —0-70 0:22 
37 -148 -0%65 057 043 0:77 
38 -0:55 —0:32 0-04 —0-79 0:18 
19 -047 —0:29 0-01 — 0:86 0:14 
40 0:84 0:28 0:15 —0:75 0:24 
4 —147 -0-63 0:55 0:38 074 
42 0:57 0:12 0:01 —1:01 0:07 
43 0-82 125 0:49 113 1:03 
44 0-79 0:24 0:12 —0:81 0-19 


(Continued) 
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“TABLE 2 (continued) 


I H HI IV Ix TI 
45 —0-92 —0:47 0:27 — 0:28 0-43 
46 1:12 0:81 0:54 0:36 0-91 
47 =121 — 0:68 0:62 0:55 0:82 
48 —0-90 —0:46 0-26 -0:32 0-42 
49 0:24 - 0:02 —0:07 -1:13 0:00 
50 2:96 —2:62 -4-61 0:03 - 7:76 
TABLE 3. ROTATION 

0-035 0-983 0-153 —0-091 

—0:334 —0-071 — 0-029 — 0:940 

0-234 —0-116 0-952 —0-100 

-0:912 0-021 0-261 0-314 


In a real case, the factor score matrix of Table 1 would have been obtained 
as the end result of an orthogonal factor analysis of m observed variables. To 
complete the estimation of the model parameters, the mx4 matrix of factor 
loadings obtained in the preliminary factor analysis would be operated upon, 
with the orthonormal matrix given in Table 3. Clearly, the interpretation is 


that the first, second and fourth orthogonal components are mutually statistically 
independent latent variables, whereas the third represents a simple interaction 
of the first two. 


5. GENERAL DISCUSSION 


A good deal of experience with empirical data will be necessary before it is 
possible to say whether the models described above will commonly prove to be 


theory, and a choice between model (1) and model (19) 
may rest on psychological con: 


made siderations, if, indeed, a rational choice can be 
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A PROCEDURE FOR DETERMINING SLATER’S i AND ALL 
NEAREST ADJOINING ORDERS 


By ГР. N. PuiLLIPS 
University of Hull 


A new procedure is described for determining Slater’s i and all nearest 
adjoining orders. The validity of the procedure is proved, and it is compared 
with Remage and Thompson’s algorithm for the same purpose. 


1. INTRODUCTION 


The classical measure of internal consistency for а complete table of pair- 
comparison preference judgements is the number of circular triads, Kendall's 
d; and the classical method of assigning ranks to the objects in an inconsistent 
table is to take those (sometimes tied) given by the row totals. Recently, how- 
ever, Slater (1960, 1961) has suggested as a measure of internal consistency the 
statistic 2, the size of the smallest set or sets of changes in the table which render 
it consistent, and as rankings for an inconsistent table the nearest adjoining 
orders, given by actually carrying out these sets of changes. 

Unfortunately the practical determination of і and the nearest adjoining 
orders is a matter of some difficulty. Slater (1961) quotes two rules due to 
Alway, but Remage and Thompson (1966) have shown, by a counter-example, 
that they are not valid. Тһе latter authors describe an elegant algorithm, 
whose validity they prove, but it is somewhat laborious to execute. This note 
presents a different procedure which has been incorporated in two computer 
programs reported elsewhere in this Journal (Phillips, 1967). 


2. Тне PROCEDURE 


Since the example of Kendall (1948) used by Remage and Thompson 
to demonstrate their algorithm is too simple to display fully the present 
procedure, a more complicated example will be used as an illustration (see 


Table 1 (а)). 

Step 1 Scan the pair-comparison table and list all circular triads (Table 1 (b)). 
To render the table consistent, it is necessary to break each triad by changing 
precisely one or two of the cells occurring in it. 

Step 2 List, for each cell in the pair-comparison table containing a plus sign, 
the number of circular triads in which it occurs and, for convenience, rearrange 
the list in decreasing order of these frequencies (Table 1 (c)): it can be shown 
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TABLE 1. 


(b (c) (d) А 
" i Frequency of Cumulative 
triads per cell frequency 


Pair comparisons Triads of triads 


Cell 7 
6,5 4 4 


j = = 
кə 
ЕЕ 
++ 


1,2 3 7 
7,1 10 
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i + + F S 
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I+ I +: 


1, 
1, 
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5, 
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2 
3, 
3 
4, 
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ка ка ка ма ма ма — кі 


5 
5, 
1 


that, both here and subsequently, cells not in triads may be ignored; the 
proof, although straightforward, is somewhat messy, and is accordingly omitted. 


We seek all smallest subsets of cells such that just one or two members of the 
subset occur in each triad. 


Step 3 Determine these smallest subsets by systematic trial and error. As 4 
preliminary, take cumulative totals of th 


e . ive t е ordered cell frequencies downwards 
until a total is reached which is equal to or greater than the number of circular 
triads (Table 1 (d)); the number of cells making up this total (here 4) gives ? 
minimum for 2. 


Then, for all trial values of i from t 


; his minimum upwards, until ¿ has been 
determined, enter the first branch of the following: 
Tree-search : 


1 For each feasible (i.e. not clearly impossible, as determined by 
appropriate cumulative totals of the current ranked cell frequency list) combina- 
tion of cells of the highest frequency, set up a new list of triads which would 
remain after all cells of the combination were changed. (The possibility that 
the changes may generate new circular triads is ignored at this stage.) For 


f 
EL. 
| 
| 
~ 
ә 
| 
| 
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5 

4 

3 
5 5 5 2 5 41 1 
4 2 3 7 4 15 5 
3 4 26 l 5 2 3 
2 5 4 3 34 2 
7 7 7 2 25 4 
6 6 6 7 7 z 7 
1 I 1 6 66 6 


Illustration of the tree-search. The two trees illustrate the search procedure 
on the data of Table 1 under the hypotheses that i =4 (falsified) and 
Each branch represents an investigation of the effect, on the current 
list of circular triads, of changing the cells written beside it (e.g., 1,2 7,1 means “ try 
changing cell 1,2 and not cell 7,1”), and terminates either in a choice point leading to other 
branches, or in a dead end (DE), or no feasible combinations (NFC) or a nearest adjoining 


order. 


FIGURE 1. 
(for details see text) 
that ¿= 5 (verified). 
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each such list there are three possibilities. First, it may be empty, in which 
case verify, by carrying out all changes under consideration, that they have not 
generated any new circular triads or changed all the cells of a circular triad 
(practical experience with both randomly generated and experimentally obtained 
data suggests that this undesired complication rarely occurs); if they have not, 
note that 2 has been determined (if this has not already been done) and write 
down the nearest adjoining order obtained; if they have, do nothing. Second, 
the new list of circular triads may be non-empty, and there may be one or more 
further changes still available, in which case set up a new ranked cell frequency 
list for the list of triads remaining (excluding all cells previously of the highest 
frequency) and enter a new branch of the tree-search with these new data. 
Third, the new list of circular triads may not be empty, and there may be no 
more changes available, in which case a dead end has been reached, so do nothing. 
When all the combinations have been considered, and appropriate action taken 
in each case, exit from the current branch of the tree-search. 


This procedure is illustrated in Figure 1 and Tables 2 to 7. 
TABLE 2 
(а) (b) 
Triads Cell £f 
i, 2 3 
4. d 3 


М кака ка ка кака кл 
PUUKWWNNN 
C 4 doo 


союне; 
= — зло 
ot t 


Фоо NO tn tA о № 
омюр о о 
н на ы н на н ы 


- Eu : bs ER value 4, it can be seen from Table 1 (c) that this could 
iue a ed 5 ce des Were changed. The list of triads then remaining is 
changes still avail s posed non-empty, and there are three further 
a ge E able, а new ranked cel] frequency list is set up (Table 2 (0)). 

€-entering the search with the data of Table 2, it is seen that the eight triads 
ree changes still available only if cells 
ts of doing so are shown in ‘Table 3, 


= 
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1 from which it is seen that there is по feasible cell of the highest frequency, so 


=“ it is necessary to exit from this branch, and then from all higher branches of 
| the search. 
| 
| TABLE 3 
(а) (6) 

| ~ Triads Cell 7 
| 1 36 1,3 1 

1 54 1, 8 1 
| 243 24 4 
| 4, 1 1 
| 6, 1 1 

3, 6 1 

| 524001 
| 3; 1 
| 43 4 


Then giving 2 the trial value 5, the first branch of the tree-search is re- 
+ entered. The successive stages will now be presented in abbreviated form. 
There are first of all two feasible combinations, changing and not changing 
6,5: each will be considered in turn. 


Stage 1 Change 6,5 (four changes still available). Result: Table 2; and 
| there are four feasible combinations, changing both, just one, or 
| neither of cells 1,2 and 7,1; consider each in turn. 


| Stage 1.1 Change 6,5: 1,2; 7,1 (two changes still available). Result: Table 
f 3; and no feasible combinations. 


Stage 1.2 Change 6,5: 1,2; not 7,1 (three changes still available). Result: 
Table 4; and one feasible combination, cells 1,3 and 1,5. 


" TABLE 4 
(а) (b) 
Triads Cell f 
136 1, 3 2 
13 7 1,.:5 2 
154 
157 2,4 1 
243 4, 1 1 
6, 1 1 
3, 6 1 
5, 4 1 
2 5.7 1 
3.2 1 
2,7 1 
4, 3 1 


Stage 1.2.1 Change 6,5: 1,2; not 7,1: 1,3; 1,5 (one change still available). This 
leaves the single triad 2,4,3 and the three cells which occur in it, 
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Stage 1.3 


Stage 1.3.1 


Stage 1.3.2 


Stage 1.3.3 
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each with frequency 1. Changing 2,4 breaks 2,4,3, so we try the 
whole set of changes 6,5; 1,2; 1,3; 1,5; 2,4: it is found to give the 
nearest adjoining order 5,4,3,2,7,6,1, so note that 7=5. Similarly, 
changing 3,2 gives the order 5,2,4,3,7,6,1 and changing 4,3 gives 
5,3,2,4,7,6,1. 


Change 6,5: not 1,2; 7,1 (three changes still available). Result: 
Table 5; and four feasible combinations, all three or just two, of 
cells 2,4, 4,1 and 6,1: consider each in turn. | 


'TaBLE 5 
(a) (b) 

Triads Cell f 
124 2 4 2 
12 6 4,1 2 
13 6 6; 1 2 
154 

2 4 3 1, 


WN с Сл > 
- ннн нн 


Change 6,5: not 1,2; 7,1: 2,4; 4,1; 6,1 (no more changes available). | 
This procedure breaks all remaining triads, so it is necessary to 


try the whole set of changes. It gives the nearest adjoining order 
1,5,4,3,2,7,6. 


Change 6,5: not 1,2; 7,1: 2,4; 4,1; not 6,1 (one change still avail- 
able). Result: Table 6 and no feasible combinations. | 


TABLE 6 
(a) (6) 
Triads Cell f 
12% 1 
136 2, : | ef 
2%; 6 1 


Change 6,5: not 1,2; 7,1: 24; 
able). This leaves the single t 
each with frequency 1. 

nearest adjoining orders 


not 4,1; 6,1 (one change still avail- 
gle triad 1,5,4 and the cells 1,5 and 5,44 
Changing each of them in turn gives the 
5,4,1,3,2,7,6 and 4,1,5,3,2,7,6. 


M „__ 
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Stage 1.3.4 Change 6,5: not 1,2; 7,1: not 2,4; 4,1; 6,1 (one change still avail- 
able). This leaves the single triad 2,4,3 and the cells 3,2 and 4,3, 
each with frequency 1. Changing each of them in turn gives the 
nearest adjoining orders 1,5,2,4,3,7,6 and 1,5,3,2,4,7,6. 


Stage 1.4 Change 6,5: not 1,2; not 7,1 (four changes still available). Result: 
Table 2 (a) and Table 2 (b) with the first two cells missing. Five 
feasible combinations, changing just four of the cells of frequency 2. 
Each leads to a dead end. 


Stage 2 Do not change 6,5 (five changes still available). Result: Table 
1(b) and Table 1(c) with the first cell missing. One feasible 
combination changing cells 1,2 and 7,1. 


Stage 2.1 Do not change 6,5: but change 1,2; 7,1 (three changes still avail- 
able). Result: Table 7 and no feasible combinations. 


TABLE 7 
(а (b) 
Triads Cell f 
1 3: 6 3, 6 2 
1 5 4 5, 4 2 
2 4 3 
265 1, 3 1 
3.6 5 1, 5 1 
4 6 5 2, 4 1 
5 7 6 2,6 1 
4, 1 1 
5, 7 1 
6, 1 1 
8,2 1 
4, 3 1 
4, 6 1 
5, 2 1 
5, 3 1 
7, 6 1 


This is the end of the tree-search, i=5, and all nearest adjoining orders 
have been determined. 


Short cuts There are a number of obvious short cuts which will occur to the 
reader during hand computations. For example, the essential result of Stage 
1.1 for і-5 can be seen without actually carrying it out: first, changing 12 
and 7,1 will break not six but five triads (since both changes break 1,2,7) leaving 
three ih that at least one of the two changes still available must be of a cell of 
frequency two; secondly, changing 1,2 will reduce the frequency of cells 
2,4, 4,1 and 6,1 from two to one (because of the triads 1,2,4 and 1,2,6) and 
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changing 7,1 will similarly reduce the frequency of 1,3 and 1,5, so that there will 
be no cells of frequency two left. Again, Stage 1.4 can be shortened if it is 
noticed that none of the cells of frequency two will break the triad 1,2,7. How- 
ever, such devices are dificult to systematize, and іп any case hand computa- 
tions are not really recommended. ` 


3. Discussion 


Validity The validity of the procedure is easily seen. Any subset of cells, 
changing which will render the pair comparison table consistent must have the 
property mentioned under Step 2, namely that just one or two members of the 
subset occur in each circular triad. Since the procedure in effect examines all 
subsets (by examining all feasible classes of them) from the smallest upwards, 
until all minimal ones with this property are found, which also actually render the 
data consistent, it will necessarily determine 2 and all nearest adjoining orders. 


Comparison with Remage and Thompson’s algorithm А fairly florid example was 
necessary fully to display the procedure, which is easier to carry out than to 
describe, so that the reader may doubt whether it is any more efficient than the 
399 computations needed with Remage and Thompson’s algorithm. 

In fact it does offer certain advantages. First, the length of the calculation 
for a fixed number of objects, m, appears to depend mainly upon i (a precise 
formulation of this relationship would probably be both very difficult to obtain 
and of little practical use) and for small i the procedure is very quick. The 
reader may verify for himself that the solution to Kendall's example (Table 8), 
where i=2, is evident at the end of step 2, or even step 1, after only a few 


TABLE 8 

1 2 3 4 5 6 
1. + + - + + 
2 = à — d + — 
E Ф + + + 
$ d Rl = 
5 — = — + + 
6 — $ =< # = ` 


minutes work, whereas Remage and Thompson’s algorithm needs 156 computa- 
tions. At the other extreme, a special pathological case, such as Table 9, where 
і-7, may be very lengthy (here each of the cells containing a plus sign occurs 
in just 2 of the 14 triads, so that the Procedure reduces to trying out all 
(7) =116,280 feasible combinations, which is much worse even than the 


7!=5,040 computations required for complete enumeration). However, 
second, the procedure almost immediately obtains a minimum value for 7, 50 
that in practical applications hopelessly inconsistent data may be rejected without 
further ado, whereas Remage and Thompson’s algorithm does not. Third, any 


len . Жа. 
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TaBLE 9 

1 2 5S 4 S 6 ж 
1 „ + + — j Ll n 
2 - + + — + = 
S = , # + — + 
Ж qoc © ¿ s* a — 
5 — + -.2 Ф + 
б c — ж в m V k 
T E p= domu 


practical use of either method requires it to be embodied in a computer program 
and Remage and Thompson's algorithm appears awkward for this, since for 
the determination of the nearest adjoining orders the results of m2”-1—m? 
(if matrix storage methods are used) or upwards of 253—1 —(1 —m3)/(1 —m) (if 
list processing is employed) computations must be simultaneously stored, 
which is rather extravagant of space. The present procedure can be con- 
veniently implemented as a recursive procedure: such an implementation in 
the Elliott 803 dialect of ALGOL 60, has been developed, and copies may be 


obtained from the writer. 
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THE EFFECT OF CRITERION SCORE GROUPING UPON 
ITEM PARAMETER ESTIMATION 


LI By FRANK B. BAKER 


University of Wisconsin 


Тһе differential effect of three grouping procedures upon the first two 
moments of the sampling distributions of the parameters Х and f of the item 
| characteristic curve were investigated empirically. The results of the study 
- indicated that differences іп the variances of B and Xo as well as in the bias of £, 
could be attributed to differences in grouping procedures. Item parameter 
estimates possessing the minimum bias and variance within the study were 
those yielded by the grouping procedure which used equally spaced criterion score 
| intervals and the same number of subjects within each interval. 


1. INTRODUCTION 


| Тһе assumption of a normal ogive relation between the probability of a 
| subject answering an item correctly and his ability level underlies a number of 
V° classical item analysis procedures. Graphical techniques for estimating the 
| parameters of location and scale of this ogive (the item characteristic curve) 
| have been presented by Terman and Merrill (1937) and Das (1964); and 
analytical estimation techniques were developed by Ferguson (1942), Lawley 
| (1943) апа Finney (1944). That the logical basis of these procedures is the same 
as those used to estimate the median lethal dose in quantal-response bio-assay 
was shown by Finney (1944). In addition, the constant process used by 
| Ferguson (1942) was independently derived іп the bio-assay context by Berkson 
| (1955) under the label of minimum normit chi-square. 

Application of these estimation procedures to psychological test data 
| usually involves grouping the criterion scores in some fashion. Typically, the 
Y criterion ability scale is divided into intervals of equal standard score width and 
the number of subjects falling in each interval is a function of the distribution 
| of the underlying ability and of sample size. Each subject within an interval 

possesses a unique ability level, but due to grouping is considered to have 
attained the mid-point score. Tocher (1949) has shown that the effect of this 
type of grouping upon the estimates of the parameters of the item characteristic 
curve can be compensated by applying the usual Sheppards’ corrections for 
4 grouping. А second grouping procedure, analogous to that followed in bio- 
: assay, is one in which the criterion ability levels of interest are specified a priori 
and an equal number of subjects at each ability level are used. This procedure 
is widely used in analytical and empirical studies of the statistical properties of 
estimates of normal ogive parameters (see Berkson, 1955; Cramer, 1964) and 
can be used in routine item analysis when sufficient numbers of observations 


228 Frank В. Baker 


ili ical of large-scale 
ссе а нан, җен» to nen 
i rammes ta 
va а A third procedure (see Das, 1964) would ps einen 

the ability distribution into regions of equal frequency and e ehe 
of the region as the attained ability score of each subject wit in the aam 
А limited comparison of the first and third of these grouping pro be 
was performed by Das (1964) who compared estimates of the item aps 
based upon a single data set having equal score intervals but an unequa wwe sa 
of subjects at each level with those estimates yielded by a second data set Ou the 
unequal score intervals but an equal number of subjects at each level. On : 
basis of these two data sets she concluded that the estimates of the ce 
of the item characteristic curve were “ to a considerable extent independent 0 
the basis for selecting class intervals, and hence, frequency, and range of ability 
within groups ”. 
Although the anal 


ytic work due to Tocher (1949) showed the effect of the 
typical item anal 


ysis grouping procedure upon item parameter estimates, ш 
literature did not reveal a systematic investigation of the differential effect i 
various grouping procedures upon these estimates. Therefore, the presen 
paper extends the previous work of Das (1964) and attempts to determine, 
through Monte Carlo procedures, if a differential effect exists. 


2. PROCEDURES 
Тһе three grouping techniques employed were as follows: 
(a) Equally spaced scores alon 
and an equal number of su 
at each level—a procedure 
(b) Тһе ability scale was divid 


g the criterion ability scale were selected 
bjects having the specified score were use 
analogous to bio-assay practice. 

ed into equal standard score intervals and 
the frequency distribution of subjects was approximately normat. 


Тһе mid-point score was used for all subjects in an interval—the 
typical item analysis procedure. 


() The fre 
divided 


The resulting centroid 
ic, evenly spaced about t 
mean—the Das procedure, 


_ The numbers of grouping intervals investigated were seven and fifteen 
which were considered 


l item analysis practice. oo 
were employed. The 42. 
7 and 15 intervals respective 


grouping intervals 
0-75c and 0:3750, for 


val widths Which divi 
or 15 equal area intervals, Within each group 


intervals, three sample sizes were employed, The sample sizes were chosen 19 
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cover the range from approximately a single classroom to the point where large 
sample theory approximations hold for the variances of estimates of the item 
parameters (see Baker, 1962). These were 42, 84 and 168 for the seven-interval 
case; and 45, 90 and 180 for the fifteen-interval case. It was not possible to 
keep the sample size both small and equal across the two-interval specifications, 
hence the nearest possible sample sizes were used. When equal numbers of 
observations per grouping interval were employed, the number of observations 
was a multiple of 6 for the seven-interval case and of 3 for the fifteen-interval 
data. The observations in the unequal frequency data were roughly normally 
distributed over the 5-250 range of the grouping intervals. 

Тһе parameters of the item characteristic curve used were those adapted 
from bio-assay by Baker (1961) where: 

Ху =the mean of the normal ogive, a measure of where on the ability 


scale the item functions; 
B —the reciprocal of the standard deviation of the ogive, a measure of 


the item's power of discrimination. 


In order to enhance the generality of the empirical results, nine sets of item 
parameter values, representing roughly high, medium, and low values of each 
parameter were employed. 

Тһе basic technique of the present paper was a Monte Carlo procedure in 
which the parameters of the item characteristic curve were used to determine 
the true proportions of correct response P; at the specified ability levels. Given 
the true proportion of correct response P; and the number of subjects, Nj, the 
observed number of correct responses, r;, at that ability level was generated. 
Тһе generated ғу were random and followed a binomial distribution having 
parameters P; and М). The observed proportion of correct response Фу= "10 
was generated for each ability level and provided the data upon which estimates 
of the item parameters Хоу and В were based. The minimum normit chi- 
square estimation procedure (Berkson, 1955) was used because it was non- 
iterative and therefore minimized the amount of expensive computer time. 
The generation-estimation procedure was replicated 100 times for each data set 


and item and parameter combination. 


3. RESULTS 


Seven Intervals 

The empirical results based upon the seven-interval data are presented in 
Table 1 and the results it contains pertaining to Хуу and f are discussed separ- 
For the specification B=0-10 and over all levels of Ху the three group- 
ampling distributions of B having approximately the 
ude of the standard deviations of the sampling distri- 
For a sample size 
equal interval 


ately. 
ing procedures yielded s 


same mean. The magnit ( 
butions of д readily orders the three grouping procedures. 
of 168 the order was: equal interval and equal frequency (0-06), 
and unequal frequency (0-09) and unequal interval and equal frequency (0-12), 


TABL И " 
E 1. Summary STATISTICS OF SAMPLING DISTRIBUTIONS OF ITEM PARAMETER ESTIMATES BASED UPON Seven ABILITY LEVELS 


oez 


Teak Equal Interval Unequal Interval 
Su 
Parameter үөн з Equal Frequency Unequal Frequency Equal Frequency 
Хо B N=42 М-84 М-168 | N=42 М-84  N-168 | N=42 N=84 N=168 
0-10 B 0:11 0-10 0-09 0:07 0:09 0-11 0:09 0:08 0-09 
E 0:11 0:09 0:07 0-14 0:13 0:09 0-20 0:17 0:12 
-1:5 so — 2:85 -1-40 —2:81 -223 0:90 -1:32 ж -0:59 0-25 
St, 15:52 7:24 5:56 19:40 25:30 4-52 ж 5:18 14-11 
040 B 0-35 0-37 0-39 0:25 0:32 0:36 0:36 0:38 0-40 
58 0-11 0-09 0-07 0-11 0-12 0-10 0-23 0-19 0-13 m 
k X;o -1:60 -1-58 -1:54 —3:00 -1:86 -1:77 —2:53 -220 -17 8 
leo 1:14 0:56 0-42 320 2:21 0-75 22-02 4-57 130 P 
ie еі w 
0:70 8 0-4 0-55 0-63 0-34 0-45 0-55 0-47 0:58 0-64 tg 
58 0-10 0-08 0-07 0-10 0-11 0-10 0-19 0:19 0:14 = 
= Хо = 1:62 = 1:57 = 1:50 -2:59 -2:07 -1:78 -2:30 —2:29 —1:64 Ft 
5% 0:69 0-37 0:23 1:57 0:89 0:39 1:62 431 0-43 
0:10 B 0-11 0-10 0-10 0:07 0:12 0:09 0-12 0-12 0-09 
58 0-13 0-10 0-06 0-13 0-13 0-09 0-23 0:17 0-12 
0-0 X50 845 0-36 —0:77 * —1:93 —2:03 —0:53 4:89 —1:56 
Sis 73-41 5:99 5-02 ы 22-87 13:56 14-21 44-95 10-50 
0-40 B 0-37 0:40 0-41 0-30 0:34 0:37 0-41 0:38 0-38 
Sá 0-11 0-11 0-08 0-13 0-12 0-09 0-22 0-19 0-13 
0-0 Жы —0:02 0:09 —0:05 —0-04 —0:06 0-04 0-21 0-07 —0-05 
Sg 0-56 0:44 0:26 0-99 0:53 0-25 1-30 1:20 0-33 
a 
= — —" ~~ = = == 


hy щш 


ja Б A £ 
: > “ қ " w 
TABLE 1 (Continued) 
Equal Interval Unequal Interval 
Пет Summary 
Parameter Statistics Equal Frequency Unequal Frequency Equal Frequency 
Жы В N=42 N=84 М-16 | N-42 N=84 М-16 | N-42  N-8& N=168 
0-70 B 0-57 0-66 0-69 0-43 0:57 0:64 0-63 0-72 0:70 
58 0:09 0-11 0-08 0-11 0-11 0-09 0-22 0-18 0-12 
0-0 X50 0-02 —0:00 -0%01 —0-02 — 0:04 0:02 0:02 0:01 0-01 
St 0:29 024 0:19 0:39 0-26 0-16 0:37 0:21 0-15 
0-10 B 0-11 0-09 0-10 0-08 0-11 0:10 0:10 0-11 0-10 
58 0-13 0-09 0-06 0-13 0-12 0-10 0-26 0-17 0-11 
+15 Ж 0-62 —7:52 0-43 * 35-34 7:48 3-29 -0:69 1:99 
Sg. 472 91:71 9-99 ж 582-12 54-35 30-28 5:47 10-98 
0:40 B 0-33 0:37 0:39 0-25 0:33 0:36 0-34 0:36 0-41 
Sá 0-11 0-09 0-07 0-12 0-13 0-11 0-21 0-19 0:14 
+15 Ху 1:82 1:60 1:54 2-45 2-17 1:78 2:01 2:73 174 
Ss 145 0:48 0:38 5-46 1:58 0-68 6-12 5:79 0:81 
0-70 B 0-44 0-55 0-61 0-31 0-47 0:56 0-46 0-61 0-65 
Sá 0-09 0-09 0-07 0:11 0:10 0-10 0-18 0-18 0-15 
+15 Ха 1:65 1:59 1:52 3-00 1:98 1:79 2:89 2:04 1-63 
Se, 0-60 0-40 0-24 1-79 0-68 0:39 3-93 2:86 0:39 


* Excessively large values not reported. 
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It should be noted that the standard deviation of the last procedure is twice 
that of the first. The sampling distributions of Ё for B=0:40 had a minimum 
bias under the equal interval and equal frequency procedure (—0-07 to 0-01) 
and a maximum bias under the equal interval and unequal frequency procedure 
(—0-15 to — 0-01). Increasing sample size from N —42 to N 5168 reduced the 
bias to very small quantities (—0-02 to -- 0-02) under all three grouping pro- 
cedures. The minimum standard deviation of Ё, for N= 168, was achieved 


е population value of B=0-70. The 


‹ сіргоса ое Sl Ne ы ‚ the sampling 
so Were influenced by the value of 8. When у а value of 
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ification of X, = 1-50 and 
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upper and lower portio: enh the 


; its 

Same normal ogive from 1 
Statistics at these two э иреше us Observed similarity of the sampling 
> e 

were employed. When B=0-49 m at adequate Monte Carlo procedur 


Ç . » Ше populati = —1:50 
and + оры were closely estimated by the titan уй of aed pm equal 
interval and equal Берл and the unequal Several aed equal frequency 
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equal interval and unequal frequency procedure yielded mean values of X,, 
which had a bias of at least one-half a standard deviation unit. This bias is 
sufficiently large to affect one's interpretation of an item. The variability of 
the Ж, yielded by this procedure was similar to that observed in the unequal 
interval and equal frequency results. 

At the middle level, X;,—0-0, with B at 0-40 or 0-70, the mean of the 
sampling distribution of X; was a very close estimate (0-01 to 0-09) of X;, for 
all three grouping procedures. In addition, the standard deviations were roughly 
the same for all three grouping procedures. 

The estimates of X; yielded over all data sets by the equal interval and 
equal frequency grouping procedure possessed the least bias and the minimum 
variability of the three grouping procedures. It should be noted, however, 
that when 8 —0-10, the sampling properties of Х,, were consistently undesir- 
able over all grouping procedures and data sets. 


Fifteen Interoals 

Тһе results based upon the data for 15 intervals are presented in Table 2 
and the relationships among the grouping procedures are similar to those 
observed in Table 1. Several characteristics of the data appear to be related 
to the increased number of intervals. The mean value of (А - 180) tended 
to underestimate the value of В = 0:70 to a slightly greater degree than was 
evidenced in the seven-interval data. The estimates had a mean of approxi- 
mately 0:55 in the fifteen-interval results and of about 0-63 in the seven-interval 
results. The most discernible effect of increased number of grouping intervals 
upon the sampling properties of Х was in the reduction in the occurrence of 
unrealistically large values of Х,; when @=0-10. Tn Table 1 there were three 
cells in which no ZX; values were reported and in Table 2 there were none. 
The final effect attributable to increased number of intervals can be seen in 
the relationship of sample size to the summary statistics of B. When 7 intervals 
were used, both the bias and variability of B decreased rapidly as sample size 
increased. However, when 15 intervals were employed the bias and variability 
of Ё changed very little as sample size increased. The initial values of both 
the bias and variability were less in the fifteen-interval results than in the 


seven-interval data. 


4. DISCUSSION 


ction of both the grouping procedures and the normal equations 


Inspe саш ! 
: e estimation technique reveals a number 


solved by the minimum normit chi-squar num 
of fotom of importance in the present study. Fundamentally, the estimation 


technique is a weighted least-squares procedure which fits a linear P = 
line to the normal deviate transformations of the observed proportions of correc 


TABLE 2. Summary STATISTICS OF SAMPLING DISTRIBUTION OF THE PARAMETER ESTIMATES BASED UPON FIFTEEN ABILITY LEVELS m 
+ 
Equal Interval Unequal Interval 
Summary Equal Frequency Unequal Frequency Equal Frequency 
Statistics 
N=45 N=90 N=180 N=45 N=90 N=180 N=45 М-90 N=180 
B 0-08 0-09 0-09 0-03 0:06 0-08 0-07 0-09 011 
58 0:09 0-08 0:06 0-08 0:09 0:08 0:16 0-12 0-09 
Zo -258 0-74 — 2:72 –2:71 - 1:80 0-42 — 0:88 -0:37 +148 
Se, 2818 16:71 10:72 17:36 22:16 1124 10:14 9:84 30:17 
l ———— 
В 0:24 0-33 0:37 0-11 0-20 0-30 0-23 0-35 0-37 
Sá 0:07 0-08 0-06 0-07 0:08 0:08 0:13 0:14 0:10 Ej 
Ху -1:86 —1:65 -1:55 -434 —2:33 —1:98 -0:55 -2:54 -1:69 Ë 
5% 1:65 0-84 0:32 26:81 7:60 0:84 10:76 6:35 0:72 a 
— ell C š 
B 0:30 0-44 0:52 0:16 0:27 041 0:31 0-49 0:56 I: 
Sá 0:07 0-05 0-05 0:05 0:08 0:07 0:12 0:12 0:10 = 
Хо -1:75 «<1%53 -1:52 -3:76 — 3:31 — 2-14 — 2:48 – 1:76 — 1:67 m 
Sto 0:80 0:36 0:25 2:23 3:15 0:46 2:28 0:77 0:34 
————————— 1 
8 0-07 0-10 0-1 0-03 0:07 0:08 0-06 0-10 0-10 
Sg 0-08 0:07 0:06 0-08 0-10 0-09 0-15 0-12 0:10 
Ху — 4:28 --0:15 0:15 — 0-60 1:58 -0:78 -0:39 — 0:22 —0:37 
S£, 38:20 4:28 1:71 13°77 25:80 8:04 3:09 3:56 2:42 
8 0-28 0-35 0:39 0:15 0-22 0:32 0:31 0:38 0-39 
58 0:08 0-08 0:06 0:07 0-09 0:08 0:14 0:14 0:12 
Xa -0:02 0-00 — 0:03 —0:23 — 0:27 — 0:01 — 0:34 0:05 0:01 
0:51 0:37 0:26 2:69 2-04 0-34 2:38 0-48 0:27 


TABLE 2 (Continued) 


Item 
Parameter 
Хә В 

0-70 
0-0 
0-10 
+15 
0-40 
+15 
0-70 
+155 


Summary 
Statistics 


Equal Frequency 


Equal Interval 


Unequal Frequency 


Unequal Interval 
=== = == == салы саз 
Equal Frequency 


N=90 


N=180 


N=45 


N=90 


N=180 


0-40 0:54 0-63 0:23 0:37 0:52 
0-05 0-06 0:04 0:07 0-08 0:07 
0-01 0-03 — 0:00 —0-66 0-02 -0:02 
0-33 0:21 0-16 6:84 0:33 0:16 


N-45  N-90 М-180 
a ANN ee NM 
0:46 0-62 0:64 
0-11 0-13 0-1 
003-003 0-02 
0-32 0:24 014 


0-05 0:07 0:07 0-08 0:09 0:08 
0-09 0:09 0:09 0:15 0:13 0-11 
0-83 1:39 312 —1:12 0:43 1:61 
4-05 32:02 27:52 11:26 14:10 8-42 
ee 
0-12 0:20 0:29 0:26 0:34 0:38 
0-06 0:08 0:08 0:14 0:15 0:10 
3:16 3:53 2:03 2:53 1:84 1:61 
11-24 4:04 0:69 3:66 1:70 0:63 
a mE J eee 
0-26 0-16 0:42 0:33 0:46 0:58 
0:07 0:06 0:07 0-13 0-11 0:10 
3-45 477 243 2:08 1:90 1:66 
2-54 6:04 046 3:05 0-76 0:33 
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The role of ‘ step-size’ also makes itself apparent іп the effect upon the 
means and variances of the item parameter estimates due to increasing sample 
size and to increasing the number of grouping intervals. When 7 intervals were 
employed, the bias of the mean of f and the variability of B decreased markedly 
when the number of observations per ability level increased from 6 to 24. Such 
a pronounced decrease was not observed in the fifteen-interval results where the 
summary statistics for B changed little when the number of observations per 
ability level increased from 3 to 12. When unequal frequencies were used with 

. data based on 15 intervals, the ‘ step-size’ problem was aggravated, in that a 
. large proportion (6/15, 8/15, and 8/15) of the ability levels had numbers of 
observations less than that employed under the equal frequency procedures. 
Consequently, the mean of B tended to underestimate В to a greater degree than 
‘that yielded by the equal interval and equal frequency procedure. This pattern 
of underestimation was also observed in the seven-interval data, but was more 


pronounced in the fifteen-interval data. 
The empirical results cited above and their elaboration indicate that the 


‘ best? grouping procedure over all combinations of sample size, number of 
grouping intervals and item parameter values was the equal interval and equal 
frequency procedure. The next ‘best’ was the equal interval and unequal 
frequency procedure which is analogous to typical item analysis practice. The 
sampling properties of the item parameter estimates yielded by the unequal 
interval and equal frequency grouping procedure suggested by Das (1964) were 
not as acceptable as those of either equal interval procedure. This procedure 
consistently yielded item parameter estimates having greater variability than 


either of the other grouping procedures. ; Р А 
The results of the present study suggest that discernible differences іп the 


sampling properties of item parameter estimates can be attributed to the group- 
ing procedure, i.e., the number and spacing of the ability scores used, as well 
as the number and distribution of the observations at each ability level. Opera- 
tionally, the bias observed in the mean of f or Хр as a function of the grouping 
procedure was not great enough at the largest sample sizes to be of concern, with 
the exception of the bias of £s under the equal interval and unequal frequency 
procedure. However, the variabilities of the estimates of В and Z, yielded by 
the unequal interval and equal frequency procedure were, in many cases, nearly 
twice as large as those yielded by the equal interval and equal frequency procedure. 

The differences existing between grouping procedures were quite pro- 
nounced at the smallest sample size and, as would be expected, least pronounced 
at the largest sample size. Increasing sample size had the most effect within 
the seven-interval data, which suggests that the number of observations at each 
ability level is an important consideration. The overall similarity of the seven- 
and fifteen-interval data suggests that increasing the number of intervals and 
application of the 1/2N rule tends to compensate for the problems associated 


with large * step-size’. 
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The FORTRAN 63 computer program used to generate and analyse the 
data of the present study is available from the author upon request. 
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NOTES AND CORRESPONDENCE 


A NOTE ON A GENERATING ASSUMPTION IN McGEE’S 
MULTIDIMENSIONAL ANALYSIS OF ‘ELASTIC’ DISTANCES 


By В. A. M. Grecson апа P. N. RUSSELL 
University of Canterbury, New Zealand 


Evidence is offered that multidimensional similarity judgements do not 
behave in a way demanded for an elastic distance model advanced by McGee. 
Problems concerning the properties of similarity data which can be included in 
Scaling models are indicated. 


In a multidimensional scaling model differing in some important particulars from 
those already published, McGee (1966, p. 182) advances a psychophysical rationale for 
introducing a tolerance when mapping perceived similarities into a multidimensional space. 
He asserts that * Psychological judgements which indicate relatively great separation of 
stimuli should be allowed greater error than judgements indicating close proximity. This 
conception is well documented in the copious literature on psychophysics and refers to the 
organism.’ He again emphasizes the central role of this generat- 
ing assumption in his model (p. 189) where he states the criterion . . . is explicitly rooted 
ina ... time-honoured tradition that large disparities are less readily resolved than small 
disparities’. Whilst his method shows relatively little difference from others with which 
it is compared, he concludes “а case can be made in favour of requiring a multidimensional 
scaling procedure to yield information concerning judgement variability i | 

The problem here is that the psychophysical precedents which McGee uses as жола 
matic may not be strictly relevant to the multidimensional situation. The x 
Psychophysical situation, where judgements of stimulus intensity or stimulus magnitude 
are elicited, leads, in some cases, to a relation of the form 

ass) m Rb, (D 


where o;(x) is the variability of the physical correlate of stimuli j 
form of equation with у (a magnitude response) for x was shown by Eis 5 
be a Weber function and may be the basis of a general psychophysical differentia 
with k corresponding to the Weber fraction. . ^t ° 
When, алық we turn (о comparison judgements in а multivariate аш ene 
and propose to use these as data for a multidimensional scaling, the pale и with у as 
arises as to whether or not such judgements do, in fact, fit eqn. (1) with x 
the variable, 
From records of recent gustatory Psy 
(Gregson, 1965, 1966 a, b, c; Gregson and R 
ave re-examined relations between mean ó; (M 
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oh multidimensional scaling we have attempted. we "t 
Omparisons for б), 8з, and б, as it is in general true tha 
ый. ЖЕЛ) A 
8; 8 and 64%0, If a relation 


ically meaningful. 
sci iie wm 4 is to be expected that the 


“resolving power ” of the 


udged equal to x. This 
у Eisler (1963, 1965) to 
] equation, 


ments we have conducted 


uL e n Gregson, 1966), we 


1965; Russell and I 
a ised notation here; but with бу based 


"e these inequalities are regular, reliable and 
the form of eqn. (1) obtains between бу and var dij 


р АШ "d. еи — И 


Notes and Correspondence 


240 


CIqndun) рәѕѕпұ 
CIqndun) Пәввпу 
(CIqndun) I[[əssnzI 
('Iqndun) [jassnyy 
(© 9961) човдәгеу 
(о 9961) човдәг) 
(о 9961) uossoi5 
(о 9961) чоѕ8әле) 
(q 9961) чоз8әлсу 
(CIqndun) [jessnyy 
(9961) uosdo15 
pue [[ә55пуү[ 
(v 9961) чоѕ8әлг) 
(S961) чоѕ8әлғу 


924n0$* 


? 


“ва зәр pejeor[dor A[ojo[duroou] y 


1800- (OL хор)? 
М00- (OL хор? 
552-0 — (01 хорї 
%0-0- (OL хор? 
80:0 — 8х6 
66:0 — 8х6 
6600- 8х6 
996-0 — 8х6 
840-0 — 01 хот 
EILO OL хот 
987-0 + 8х8 
с<0- Or XOI 
ға-0- 8х8 
$ә2ию14ю2 рир axis 
$ирәш 1122 хыр 
иәә?2]ә7 4 


91 
91 
8I 
LI 
9I 
9I 
9I 
9I 


Oct ++ ++ ++ ++ ye 


asez zou [+£ 0] 


*96 


+ 


x09 
+ [41 
£ OL 


сә 


$иоззиәш1р М 


snynus 


fo ON 


AN XN 
AN XN 
AN XW 
AA XA 
Их ИГ 
NX ИГ 
Их W 
Их W 
Их W 
NX W 


Их И 
Их W 
M X W 
adh quausadxa 
әѕиофѕәуг fo 2d], 


9U4 «060094 


ѕімяипняахя ONIIVOS 'IVNOISNSIXIGLLIQJAD £] NOJ SHONVINVA MNIJHL ANV 
Sulvq SNIOWILG XHTdINOO AO SHLLDHV'IINISG аялізояяд NIIMLAG SNOLLVTAY СІ S'IHV], 


kaz ааа Е 


Yag 


asuata [ә 


Notes and Correspondence 241 


correlation (б, var 8j; : 
Positive [iis r oes prse piede ы. un m dd жеш Бе қалала 
ties а: EU та component of regression and eqn. (1) is linear. 
intermodal (Mx M^). 3 үй ^ жесі were involved, intramodal (Mx М or M'x M^), and 
eels ence gd li case either taste mixtures, М, were compared with one 
M ODE неа 5 М”, were so compared. In the intermodal case, taste mixtures 
type o£ der dels leoni pictures of taste mixtures. "Table 1 above sets out: the 
emu d s — Mn ср type, Су or Ajj; the number of subjects involved, N; 
8; muri ce. i — introduced; the matrix size; and the correlation between 
ik nooled = ЖР y is taken as the mean Likert scale value of a difference or matching 
es Heber a лыс cn is important, as replicated scalings on a single subject 
G ee ao n though in the case of gustatory psychophysics we strongly doubt 
Б = ерт. у supported doubt has been a reason for the use of data pooled 
loui rh i e original study is also given where relevant data have been published, 
ne these correlations may not have been reported. 
urther examination of the plots of 8;;/var ду indicated that some of these correla- 


tions ar ; š Е 
Еа = numerically small, partly because the relationship in question is curvilinear; in 
ses the largest variances are associated with intermediate similarity magnitudes and 
with stimulus mixture pairs which are 


pem variances with large or small 8; that is, 
INTE je per as ue similar or very dissimilar. | 5о, оп the available evidence, there 
im өнері vo types о experiment where McGee’s axiom, and hence his model, are 
ppings of behaviour. 
ОН, шүү light may be cast on this problem by consideration of cases (f) and (h) as 
aulis а) and (i) in Table 1. 'The larger negative values are associated with a task in which 
jects were paced to perform relatively quickly, whereas the smaller values are associated 


wit] i В : м 
h precisely the same stimulus task performed more slowly. However, in cases (j) and 
onse, whilst for (/) and (m) 


M bad. e one in which 10 sec were allowed for a resp! r 
compariso aS Һай to be performed in 2 sec. Also, in cases (k) and (m), anchor stimulus 
formula is illustrating large and small distances, as predicted from Kruskal’s (1964) 
that sibi төре provided for subjects at the start of the task. It seems pertinent to observe 
ао Bo similarity is а metric that does not necessarily behave like distance or 
identical e, for similarity is closed, bounded at both ends, where either stimulus pairs are 
maeni or they are as dissimilar as they can be. This is not an analogue of stimulus 
gnitude which is bounded by a lower and an upper threshold unless stimuli used in a 
Particular experiment cover a much wider range than is usually practicable. | 
йышт _complications arise because: (i) identity is single-valued but maximum 
intern pro is multi-valued; (ii) with complex stimuli their interdimensional variance or 
Mite ораса affects the reliability with which they are identified, which in S 
quate 25 Variance of the perception of à;j in which they are involved; (ii) there M inade- 
шу; evidence on the relation between the response variables similarity and a poem 
d dn similarity response, Ajj, а dissimilarity response and ду the ace m ue 
масу are intended to estimate, presumably Aj=k бір but it is on өле. Gn 
might d. 2d <> Ka JB) orit yu Ae ы what деле ед dissimilarity is 
linearly la npublished data collected by Russe suggest t ied nest opu 
edi elated to the degree of subjective confidence of a difference, Cip ? ique 
Simi ave scored Су <> k’§;;, but that similarity is not necessarily linearly related to 
ilarity, J? 
the „саев with McGee when he concludes 
of the pro multidimensional scaling procedures ' ап 
similaritieg — of the judgements and scales require 
сСее’ S сац WE properly decide what the appro’ 
field: a 5 axiom was as good as many others that have beet ediby wor бе 
data h Dosteriori his model seems to be restricted to experimental situations in she 
d. As no such restriction 18 — y 


QI 


s to be the teacher in 
y a closer examination 
for multidimensional 


might be. A priori 
rkers in this 


* experience appear 
d think that only b 
d to generate data 


priate axioms 
een advanced by wo 


ау: H " 
€ properties which we have not encountere 
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Kruskal’s (1964) methods, then if we are not prepared to commit ourselves to the sire 
plausible detailed structure of our distance measure on the grounds of evidence d 
in closely pertinent psychophysical situations, it would seem best to use non-metric rather 


than elastic spaces. If McGee is prepared to try different axioms to eqn. (1) his model 
could still be elastic and have more general validity. 
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A REPLY TO SOME CRIT 
ICISMS OF EL. 
MULTIDIMENSIONAL SCALING aie 


By Уістов E. МсСЕЕ 
Dartmouth College, Hanover, New Hampshire 


Gregs Ы ; 

bim су P c have raised an important issue in connection with the error 
"p ұйы d M ata. In McGee (1965, 1966 a, b) the discrepancy between di 
proportional to dy (or e nica distributed with zero mean and standard dus 
variable) This T dh epending on which of dij or d; one wishes to call the random 
hold for B a а еггог function is a direct outcome of requiring Weber’s law to 
Haenel] mir козе cni a and so we shall call it the Weber function. Gregson and 
taste ашаа be t е validity of this error function in the context of experiments with 

and their criticism is well taken. Since a number of other investigators have 


given mi ева i ; 

асай ea benefit of their reactions to * elastic ' multidimensional scaling (called EMD 
n now on) and since they point to the need for clarification of this same issue and 
First, the form of Gregson and 


poe те others, this reply will be organized as follows. 
cee ere will be examined with a view to showing that, although the shape of 
the Weber nerion needs to be examined empirically (as they say), their reasons for rejecting 
есімінен —_ are not compelling. Second, it has been argued that making Ау) 
laws iti nemom springs’ is completely arbitrary’. куШ be shown that if Weber’s 
Third, makin to hold for dissimilarity judgements then Aid; c is a necessary consequence. 
таса тк. inii of some valuable advice from Carroll it will be shown that the EMD 
function desi e made quite general. In particular, it can be made to handle any error 

esired. And fourth, the matter of the chi-square decision rule will be clarified. 


(а) А Reply to Gregson and Russell 


"Тһе: А 5 А 
se two investigators point to ‘ the purely empirical q 


not such [dissimilarity] ; T 2 
[dissimilarity] judgements do, in fact, бе some specified error function. This is 
r of experiments using taste mixtures and 


a : 
erus. 'They go on to report a number of exp 4 g taste : 
and the small me largest variances are associated with intermediate similarity magnitudes 
est variances with large or small б)”. Their statements аге somewhat too 
the Weber function as used in EMD 


emphatic ; ria 
Md s des my opinion. In attempting to verify 4 
, it was found to be well-nigh impossible to analyse categorical data in a reasonable 
35. The trouble is that categorical data represent 
um. It is not 


ma: 
a rp i a purpose of estimating varÓij. I i 
Possible to к information assumed to exist on some underlying continu 
where 8;; i ecapture the information lost in the process of categorizing. Hence var б, 
looking i бр кеп rating for a category, is not a satisfactory estimate of what we are 
metic with di З Kruskal says of Shepard’s crite: * we shall find ourselves doing arith- 
Oormation issimilarities ^. As one reviewer һа: * once the best monotone trans- 
much ab, enters into the picture, neither index ( uskal's stress) tells one 
made? € the variances of judgements on the c the judgements шеге 
categorical is difficult to see how any useful statem > m bout anche on i 
Variance di scale whose metric is unknown. (Parenthetically, it is easy to show t à equal 
аз larger istributions on a continuum between zero and some finite upper limit show up 
variances in intermediate categories and smaller variances at the lower and upper 


ends wh 
еп the continuum is chopped into equal-interval categories.) $ | 

and :#0, I have found in work with 

at good judges (i.e. 


. With 
distorteg eee to the inequalities 8,7 yi 
those “зш, samples (which must be presented successively) th good J 

Q2 Se correlation between two complete sets of 105 dissimilarity ieee 


uestion . . . as to whether or 


rion, 
s put it, 
McGee’s W or Kr 
ontinuum 07 which 
ent can be made а 


244 Notes and Correspondence 


over the same pairs was significantly higher than 0-5) do not show the x bones 
consistently. Worse judges do consistently show order effects, strongly in ra sides 
second speech sample. In the case of the second inequality, since we do 5 о жет с 
judgement, it is necessary to determine what non-zero scale value represents the se етті 
In sum, I agree that empirical work on judgement inequalities and judgeme eiat 
ability needs to be done. Such work fits the motivation for developing the PM TOR vm 
namely, that multidimensional scaling procedures themselves should yield in ее 
concerning judgement variability’. If and when an experimenter obtains inform: 


about his error function, then the general EMD procedure to be explained in (c) below 
may be applied. 


(6) Given the Weber Function, À;;d;;= с? is true 
Let us assume that Weber's law holds in 
following proof (due to Joseph W. Serene) show: 


The force required to distort the spring, dij, to a new length, dij, is given by 


Fij= kij(dj— dij). 


NAN r he 
the case of dissimilarity data. ‘Then s 
s that Aid; = c? is a necessary consequence: 


The work needed to effect this distortion is found by integrating with respect to distance: | 
4; ^ 
Bis 1) 
wij= j Еу- frats- Фу= 3 (dj— dijy. í 
dis 2 


The work criterion then has the form: 


1575 (d;— dz. 
In terms of eqn. (1) in McGee (1966 a) ki; and Aij are related by: | 
j=kidy]2. | 


Тһе в depend upon the stiffne: 


Ss of the sprin 5, whic 
for error іп the data. E 


e 
jh à ; h in turn depends upon the per #7 
Ud Ssuming Weber's law to hold, the ki’s should be chosen s 
%-4) _ (а-а, e 
ў dmn à à; 
then | 
Uij = Wyn 


Ay - ny — dnn)? 
kmn (а;- ауу ` 
Using eqn. (2), this can Бе rewritte 


n: 
which is satisfied if Fn i 
ky = Саа, (3) 


` 
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The work criterion then becomes 
C /d;— da? 
v-z. : 


Since the magni 
gnitude of the constant igni 
to eqn. (4) in McGee (1966) en" no significance as yet, сап. (4) reduces immediately 


W-Y (bg 
a 3 


where pij= (dij— 4/4). 
Th i i 
“КЖ... тене establishes the necessity of the relationship Ауу = c? if Weber's law 
me тты > Т . Without wanting to appear fixated, it should be noted that several 
eel a natural empathy for this assumption even in connection with similarity 


data: for example, Jones (1959, 1967). 
(c) eme the EMD Procedure 
n a particularly fruitful series of letters starting in 1964, J. D. Са i 
larl , J. D. Carroll has contributed 
much to my thinking about EMD methodology. The following generalization is "as 


largely to him. 
Consider a set of dissimilarities which are monotonically transformed into distances, 


(dj). We shall call these the observed distances. Then the EMD model can be stated 


formally as follows: 
dij= d ei, (5) 
and ej; is an error term which is assumed 


wi si А A А 
һеге dj is the ‘ true ' distance between : and j, 
We further allow that this error 


eal distributed with mean zero and variance oij*. 
rm is functionally related to the true distances: 
oy —f (dij), (6 


where f i : ^ ы š ME 
$; f is any function, monotonic or non-monotonic. The work function to be minimized 


is: 
Wad 56-405 о 
із 
2 | cy=Cloy? (С> 0). (8) 
ombining eqns. (6), (7) and (8), we obtain: 
(dij dij)? (9) 


W=C 
2 2 қар 


6)), this work functi 


he degrees of freed 
d as the ratio of erro 


choice of the error 


on is immediately distributed 
lom problem is treated in the 


Giv Р 
еп the prior assumptions (eqns. (5) and ( 
г variance to true 


Tr ec degrees of freedom. (T 
variance. ` e constant C can be interprete 
Nos PE Doe shown above has the advan ago si et of correcting an 
error in the 1965 FORTRAN йы, >: ре A mit ibt error function it 

program, it should be caci AMALG in the 


18 necessar Б s e 

y to determine a specific algorithm (subroutines А d AMALC 

RTRAN program) for finding the best monotone transforms of the dissimilarities. Tte 
966 a) was inappropriate for the 


Use of the K 
ruskal’ : Е \ 
армаган > а program (Spring 1967) subroutines 


Procedur: 
е. Іп the latest version of the comp £ DE 
the о. prey are the appropriate ones for the Weber function 1n ЕМІ. Copies 0 

roved program may be obtained from the author. 

£ i t 

dig response to Gregson and Russell's final statement: If McGee is m 
Validity ° [generating] axioms . . . his model could still be elastic and have m 

» I believe this generalization above satisfies their suggestion. 
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i-: riterion Р А cation 
с o p = of the chi-square criterion has ranged from š а 
of the work criterion as a chi-square test is wrong... the iin d nar слей Біне 
as a statistic ’ to * the use of the chi-square distribution seems very clev s o n 
only it should be labelled an approximation '. In addition to prn n "y durius (1965, 
insisted from the beginning that the degrees. of freedom, as in po еке 
1966 а), аге computed incorrectly. Му original formulation state Я 


fit, 
k k—r (10) 
d.f. for r-space fit — ( 5 ) = ( к n 


Carroll argues that the formula should be: 


kor (11) 
d.f. for r-space fit = ( я š 
. b 2 
In our more recent work we have decided that the appropriate formula should be 
k (12) 
d.f. for r-space fit = ( " ) š 


Тһеге is a problem concerning how 
the one hand, when discussing the Е 
a set of dissimilarities, (8) ; (ii) 


: ó " A iew we 
ij} in the r-space solution. From this point of v 


bers. 
ij} to be two independent sets of ere 
о make the d;;'s fit the djs may then be interpreted as a c 


eedom given by eqn. (12). 


t ransforms, 
Subroutines to find the best-fitting monotone trans 
(dj). From this point of vi i 


| 3 procedure ties the d;;’s and the djs together, so that they 
not independent, but When it comes to specifyi 


à à dence 
: | ng the shape of this functional ig an 
the analysis becomes intractable. In the current State of confusion I reject eqn. ( users 
hold in abeyance the choice between eqns. (11) and (12). As a matter of fact, most 
of EMD do not use the chi-square criterion in any explicit manner. 
Prognosis 

There is no doubt further room for improvement in t 
mple, the use of 


examp > of normal distributions is (to 
especially when it is real; 


ealize В ч 
of “ceiling” (or “floor”. ; d that negative dista 
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COMPUTER SECTION 


APL : A COMPUTING LANGUAGE DESIGNED FOR THE USER 


Ву S. Hunxa 
University of Alberta 


Traditionally computer languages have been thought of only in the context 
of communications with a computer. Most languages in common use today are still 
more dependent upon machine design characteristics than the needs of the user. 
APL, designed by K. E. Iverson, is a new language equally useful for purposes of 
communicating with a computer as it is for use in the classroom. This powerful 
language can be learned easily and applied to a wide range of problems. 


1. INTRODUCTION 


Because of the widespread need for students and researchers to solve their 
Tesearch problems with the assistance of a computer, it has been usual in the 
last decade to include computer programming as part of the university 
Curriculum. In North America the programming language has typically been 
FORTRAN when the objective was to provide the student with a tool for solving 
Problems and not an understanding of the internal operation of the computer 
Itself. 

Existing lan es, although having m 
Codings өші а5 МАР dF AP, still haya one major disadvantage when used 
In the educational setting or where the computer is used ав ап aid to the solution 
of problems provided within the classroom. One of the major ler ae 
With the length of time needed to instruct a researcher in the language, an ir. 
When the language is learned, frequently the reseätcher apendi ШИ que 

debugging’ than upon understanding the concept ан a pio Hi 
further criticism of existing computing languages 15 that m d. for the 
themselves to use in the classroom, particularly on the blackboard, 
scription of algorithms. ; : ; been 
ad Iverson (1962, 1966) has described a mathematical -— s Eis the 
cl Pted for computer usuage, and which can easily be x c. known ав APL, 
i "Ssroom quite independently of a computer. The language, 
° also Perfectly suited for the description of арбыйт. 
Spec; major advantage of APL resides in the fact pm меу 
е fied is that which is customarily used in logic ей. AS 
consid’ for the user to master new symbols is D ise ind ai 
ег the simple problem which requires the addi 


е el Y 
ments of a vector of order z. 


arked advantages over symbolic 


h of the symbolism 
ematics. Thus 
an illustration, 
ultiplication о 
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Arithmetic Notation APL Notation FORTRAN Notation 
Ex Tx SUM = 0 
DO 1 J = TN 
1 SUM = SUM+X(J) 
Пх; x [x SUM = X(1) 
DO 1 J = 2,N 


1 SUM = SUM#*X(J) 


The similarity between the Iverson notation and usual mathematical 
notation is striking. In fact, one might argue that the arithmetic notation is 
inadequate relative to that of Iverson because the Z and II do not convey to 
the reader the basic difference between the two arithmetic operations, i.e. that 
one involves an addition while the other a multiplication. Since the addition 
and multiplication operators are included as part of the Iverson notation for 
these operations, a clearer Specification is made. In contrast to FORTRAN, 
the Iverson notation is obviously more meaningful and simpler. 

All of the examples which follow have been run on an IBM 360/67 computer 
located at the University of Alberta. The interpreter used with this machine 
was provided through the courtesy of the 'T. J. Watson Research Laboratories, 
Yorktown Heights, New York. It should be emphasized that APL is being 
used with an interpreter rather than with a compiler. With the interpreter 
there is immediate execution of the instructions and the results are then available 
for display to the user. In our particular System, IBM 2741 terminals are use 
em. One marked advantage being realized by the use 
Tors are immediately made evident by the system, an 
with the computer in a ‘ conversational ' mode. 


chy of operators, there is a right-to-left precedence 
erator acts on 


d of the statement. When an operator requires parameters 


scalars, x; vectors, 
except in specific examples where 


Шынылы, ТТТ. 
(00 А PPUen ЖА Оза [| vas ITC )e»nuiT| 5: 
Ficure 1. А typical 1 і 
рарег, statements ТҰТ ессе Set appearing on a computer print-out. In this 


il А > PT itals 
replaced by ordinary ета use the same notation but with the italic сарі 
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Arithmetic Operations Р 
The following symbols and their arithmetic operators аге used іп APL: 
T (addition), x (multiplication), + (division), — (subtraction), ж (exponentia- 
ton). It should be noted that, with the exception of exponentiation, these 
symbols are identical to those usually defined. They differ from FORTRAN 
notation in that + is used for division, x for multiplication and ж for 
exponentiation. 


Specification and Display 
In FORTRAN the — symbol is used to indicate the operation that the 
variable on the left is replaced by the result found by carrying out the operations 
on the right. Output operations in FORTRAN may be specified by a WRITE 
Statement followed by the logical device number and a format specification 
number. Unless output is to be binary, the format must be included in 
FORTRAN. In APL the symbol equivalent to the — is the left-pointing 
arrow (<), and output is indicated by the symbol О. Using Iverson's notation 
we have the following: 
Specify < 
Display O 
on is extremely simple relative to FORTRAN, 
some computer users might 


Although the display notati 
output that 
n acceptable 


it does not permit the versatility in : 
desire. It is, however, relatively simple to arrange output in a 
manner. Display is assumed if no variable name is specified. 


Reduction Operations | | 
The operations indicated above, where we contrast stent ae 
and product operations as they would be used in FORTRAN m _ d S 
Teferred to as reduction operations. Any binary operator or rela xx ee 
extended over vectors and matrices. This is a very useful and e peas 
Sion not easily handled in other computing languages. ae ux Е и 
type to denote scalars, vectors and matrices, We define the reduction ор 
as follows: 


(а) Reduction over a vector: 


у-0/х 
y =X, © % O 13 Q x, ... O Ха 


(8) Row reduction on a matrix X: 


у-0/Х 
y=% Ө 5% © ха O Nae © Xin 


(с) Column reduction on matrix X: 
y<@/|[1]X 


" OL 
yk =X Ө Xok Q жак Ө Xyk 
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(4) Matrix product: 
C~AO,.0.B 
Gr =Q,/a;.O2b., (assumed over all j and k) 


zs > operator 
In the definitions given above, the symbol 9 represents any yea А ratios 
or relation. A few examples will serve to illustrate the advantages nd matrices in 
given above. (No distinction is made between scalars, vectors ar ing notes.) 
our examples since the distinction will be clear in the accompany 


APL Statement 
Notes | келен 
Sum a vector of elements in A 

and specify as Х Hex /A 
Find the product of the elements in vector А 

and specify as H C«—A4.xB 
Matrix multiplication of A times B 

and specify as C U= JÀ 
Generate a 1 when successive elements of А, 


beginning at the right, are equal 


F urthermore, th 


tan 
A NT ee : ; unders 

6 matrix multiplication statement is not difficult to 

‘since it indicate: 


5 that first a multiplication The result 19 

nd the corresponding elements in a column of B. d columns: 

then Summed, and the whole operation repeated over all rows an vd 

f the matrix Product, a useful variant is the null paa ed at ? 

by the symbol о, is symbol indicates that no operation is de o colum? 
i For example, if x and y are tw 

“would be specified as: 


Concatenation 
Very frequently it is re 


to form new vectors, 
concatenated as illustra 


пй” 
‘joine 
quired that a vector or a set of scalars = | 
In this Instance the comma will cause уап 

ted below: 


Unya d 


n 

уа 

ctor J 

as been concatenated to the v not 
+ 


ж”. 
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Rank and Restructuring Operations 


ia i = m necessary to determine the number of elements in a vector or 

of a matrix, or to rearrange the elements of a vector to form a matrix. 
For such operations a monadic (single argument) rank operator and a dyadic 
(two argument) restructuring operator have been defined. 


(а) Monadic Rank Operator 
Тһе single argument rank operator is defined in the following manner: 


If x=, No Vg 5... Ха 
then л, the order of the vector, is given by 

| рх, 
and if X =X; x; X3 X, ... Xc 


then z,c is given by 
pX 


As an example of the use of the rank operator, the arithmetic mean 
be stated in APL notation as illustrated in the following example. 
APL Statement 


would 


Notes 


Specify X as the vector of data Xei2345 2 
Divide all elements of X by the number of MEAN-—- /X+pX 
observations; sum the result 
[MEAN 


Display the mean 


It should be evident that in this APL interpreter no DIMENSION statements 
аге required, and that ро loops do not appear where they would not usually be 
expected. 


(0) Dyadic Restructuring Operator f 
tructured by using the 


A vector of i licated or res 

7 elements may easily be replicate uctu 

dyad ІС p operator. "Тһе left-hand argument may contain either d z 
indicate the number of times the right-hand argument is to be rep «e bis 
two scalars indicating that the right-hand argument is to be restructured to 1o 


а matrix of the order defined by the left-hand argument. 
APL Statement 


Notes Result 5 
Specify X as 5 устаз 
Specify Y as the vector 1, 2, 3 EP 
isplay X replicated twice 1—(3,3)pY 


55 
Display Y restructured as three rows and 1 3 4 
and three columns i 23 


[1 ,3)p1,0,0,0 


Display as three rows and three columns 100 
the vector 1, 0, 0, 0 010 
001 S.P. 


254 5. Hunka 


In the last example, restructuring the vector proceeds in such а manner 
that elements not forming one line are used to form the next. 


Interval Vector Generation and Index Operations 
It is very useful to be able to generate a vector such that 
12, ут 
This vector would be useful for generating subscripts, operations involving 
factorials, although in APL a factorial operator has been defined, and where a 
function requires evaluation between some sets of limits. "The operator is 
used for this purpose. We may define this operation as follows: 


x< un 
X=1,2, 3, ...,n. 


Notes Result APL Statement 
Specify the vector X as 12345 X5 
Specify a vector as 1 2 3 4 5 to 72-1012 4—73 $05 


which minus 3 has been added 


If a student wished to evaluate a 


I function such as x? for values between minus 
2 and plus 2 he might proceed i 


n the following manner: 
(C3 +15) #2 
If he wished to find 50! the solution is as simple as typing 


Vector and Matrix Subscripts 


In all the examples : Б 
А to this poi A sed 
without reference to subscri point, vectors and matrices have been u 


) : 5 
pts. When required, subscripts for either vector 
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E or matrices i 
4. D = defined by enclosing them in square brackets. The delimiter 
tae nd column subscripts is the semicolon. The subscript notation 
erencing a complete row or column of a matrix. In addition, APL 
, 


permits subscripts to be vectors. 


" š Notes Result APL Statement 
M Specify a matrix M -—— 
rie element 7714 2 мп 2l )p 456789 
isplay the second column 258 M[;2J 
36 M[1,2;3] 


Display elements ms and ms 


Floor and Ceiling Operations 
or L. may be defined as follows: 


ke Lx 


'The floor operat 


Such that 
k<x<k+1. 


If the vector x replaces the scalar x the result k is a vector containing the floor 


elements of x. 16, however, we use 
ke L. |x 


is specified as k. 


be defined іп а similar manner. In the case of 


then the smallest element of x 
'The ceiling operator can 


-^ the ceiling operator Г, 


қ АГ х 
will result іп # being defined as 
Һ-1<х<Е. 
If we use the vector x, then 
ВГ [x 
will specify Ё as the largest element of the vector X. 
ады 
Notes Result APL Statement 
Specify a vector x X<2.6,3.1,6.2 
Display the floor of X 236 LX 
Display the minimum element of X 2.6 L/X 
Display the ceiling of X 347 rx 
Display the maximum element of X 6.2 rx 
' Compression Operation 
e certain elements from a vector 


s required to remov 
ements. This operation removes elements 


If a binary vector u is defined as containing 
g to the position from which an element 


Жид Уегу frequently it is requ 
апа to ‘compress’ the remaining el 
ao changes the order of the vector. > 

in the index position correspondin: 
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š Р T T€ 
is to be removed from a second vector, and 1’s in the index оо үс 
elements are to be retained, we may define the compression operation as follows: 
z<—u/x 


24% Я x 
and the result will produce a vector z containing only those elements of 
which were not suppressed by the 05 in u. 


Notes Result APL Statement 
Specify the vector X X—2,8,4,3,9 
Suppress even elements of the vector X 2492 (1,0,1,0,1)/X 


Factorial Operation 


The factorial operator is identical in APL notation to that usually те” 
mathematics. Its use in the System requires that the operator appear to 
left of the scalar (or vector) on which it Operates. For example, 


FACT< (150) 125 
of 50! divided by 251. Since this is a very large 
reter displays the result in exponential format. 


will specify FACT as the value 
number, our particular interp 


Relation Operations 


All of the relations < S = > > and së in addition to the logical operators 
V and A are available, Results produced are binary, i.e. a 1 if the condition is 
dition is false. These Operators are very important fat 
determining the conditi ing and for certain tabulation operations. 
Our example below show. le frequency tabulation may be carried 
out. The use of these о 


; à the 
: ranching statements is reserved for 
next section, 


5 how a simp 
perators in b 


Notes APL Statement 
Specify vector X<3,2,4,6,2,2,1,4,6,5 
abulate 13412135 FREQ-— + /(16)o.— X 


Result 


FREQ, in the above exam 
the elements o 


f the vector 


а А san of 
Ple, will contain the correct frequency tabulation 
will generate th 


с о = X, 
orx. The right-hand portion of the statement, (46) 
е following matrix of 1’s and 0%: 


X 


5 

[E № 
орноо o 
ооооно N 
Фонњоро + 
моәоооо a 
ooooro кюю 
oooo-o mw 
ooccoe н 
ec = cocoes + 
њооооо o 
O^Oococo u 


1 
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Тһе final i i i 

b cde ue Van eene ns н. Operation (+/) over the rows of the matrix, 

ae rie ш the frequencies 1, 3, 1, 2, 1,2. It isa very twr 

sha ерлердікі is the maximum and minimum values and then to generate 

ря categorization when the numbers to be tabulated do not 
nuous sequence beginning at one. It should be noticed that the 


right-han e i 
d statement producing the matrix to be tabulated by row, is of the 


general matri Y t: < 
1х produ i i i 
р duct form. In this sense, Iverson’s notation is markedly 


Sume Sx de é 
perior to existing mathematical notation. 


Function Definition and Branching 
"Hos APL requires an interpreter and executes each APL statement 
indi e iately, the statement itself is lost unless special instructions are given 
icating that a set of statements is to be saved. Therefore branch statements 


ca eua S 
n only be used within a set of APL statements which have been specifically 
ion even after execution. Readers will be 


labe EA 
"me^ un as requiring complete retenti 
amiliar with this concept since it is essentially the same as functions in other 
Programming languages. 
! PSI "P š š 
Тһе specification of a function 18 made in the following manner: 


Re Al NAME A2 
name NAME and takes in two 
vectors or 


In this instance the function is called by the 
as the first 


arguments specified as Al and А2. Al and A2 may be scalars, 
matrices. "The result is returned іп К. The sumbol 57 must appear 


and last symbol to specify that a function is being defined. 
d Branching statements can only be made within functio 
efinition of a function, a statement is lost upon execution. 
of a branch statement is as follows: 
-(х R y)/s 
atement if the relation specified by R 
the next statement’. For example, in 


ns since without a 
The general form 


d as ‘go to the sth st 


This is interprete 
otherwise go to 


between « and y is true, 
the statement 

-(5- CTR)/7 

if CTR was equal to 5 the seventh statement would be execut 
next statement would be executed. Arithmetic expressions are 
the parentheses. If, in the follow s a vector, then 

>(1< + 10=х)/4 

е executed if х contained 
lace of the nume 


ed, otherwise the 
permitted within 
LJ 


ing example, x i 


t to b one or more 
ric ones used 


W 
ould cause the fourth statemen И 
Iso be used in р 


zeros, S uices 
i ymbolic addresses тауа 


in 
Our examples. 
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3. АРІ, PROGRAMMING EXAMPLES 
Example 1: Sign Test 


РЕОВ-«-Х ANSIG Y 
[1] Шы. Li(+ /0<X-Y),+/0>X-y 
[2] N—(pX)- + оа 
<—(\PND +1)— uM 
B е н /((IN)+(R) x (N—R)) х (0.56) x 0.SsN— R 
[5] PND,N 
[6] V 


The following explanation is given for each statement number: — 
A the s 
[1] Find the number of positive and negative differences and specify PND as th 
of the two. 
[2] Specify N as the numbe 
X and Y. 
[3] Specify R as a veci 
(N-R)!)PRON-R 
[4] Calculate the probabilities and sum. 3 Я leted. 
[5] Display PND and N. PROB is displayed upon the function being comp 


en 
i ces betwe 
r of elements in X less the number of zero differen 


А ПЕ! 
: Р қ А ation (М! 
tor of event categories as used in the binomial equ 


Example 2: Kruskal-Wallis Analysis of Variance 
VH—NG KWA X 

[1] RO-—RANK X 

[2] бе 

[3] СОМ--0 

[4] K—1 

[5] GHG, + /RO[CON + ((NGIK))} 

[6] —+(K=pNG)/10 
[7] CON+-NG[K]+CON 

K<—K+1 

[9] —5 
[10] 52 ((RO)2) + RO) X (+ (G«2)— 
VR—RANK X 


D] XRO-XIS--Ck [Gto s X) 4 Gto cg Х)». «ipX))upX] 
Б] RT + ERO XR) ЛХ)». Sepp 


NG))-3 x (pRO)+1 


. NG 

The calling Statement of the Program is specified in the statement s 

KWAX. They turned by the program while the ene ді res 
nction X is à Vector containing the observe 


tor 
e vec 

re contained within the data defined by th Is the 
The function Kwa calls up when 
ector of observations X. 


_——— —„„——— 
— &— ч 

—,coTJ,c-_—əƏəÀ—ə— ə>——.>—.—DK. 

P 


APL: А Computing Language Designed for the User 259 


this is com i Tn W. 
pleted the vector is returned as 
) RO and KWA then procee 
the appropriate summed ranks for each group. ac 


Example 3: Correlations 
V CORREL 
‘THIS FUNCTION WILL AC 
CEPT BY RO Y 
" VARIABLE SCORE MATRIX' айын” 
n ae TYPE THE NUMBER OF VARIABLES NEXT’ 
[4] R-—(NV,NV)p0 
[5] ‘NOW TYPE THE NUMBER OF OBSERVATIONS’ 
[6] М0 
[7] ‘NOW ТҮРЕ THE VARIABLES FOR EACH OBSERVATION. WHEN 
FINISHED TYPE -99.” 
[8] M—NVp0 
[9] 5--МУр0 
[10] —12 
[11] “ERROR RETYPE LAST LINE----- LAST LINE IGNORED‘ 
[12] x-0 
[13] —(12 + /[-99- Х)/ЕМЮ 
[14] —(NVzpX)/11 
[15] ReR+Xo.xX 
[16] S——S.-X«2 


[17] M—M-4X 
[18] —12 

[19] END: ‘MEANS’ 
[20] nm 


[21] []--M-—M-—N 
[22] 'STANDARD DEVIATIONS' 


[24] S—((S—N)— M«2)40.5 
S 


[26] ‘CORRELATIONS’ 


[27] 
[28] R-—((R--N)- Me. x M)—So.x S 


[29] R«—(L.100 x R + 0.005) +100 
[30] V 


Titles whic 
Data to be accepted while on- 


Specified as Г]. Statement num 
termination of data input is made. 


been input, and if it has, 
Checks to determine whether the correct number of variables have been input. 
If an incorrect number is sensed, control is transferred to an error message and 

Statement 15 calculates the matrix of 


a request is made to retype the last line. 
Sums of squares and cross-products and accumulates the results over all observa- 


tions in the matrix R. Statement 28 is the matrix equivalent of 
rje = (Qo |N)— у ЈА (6056). 


Statement 29 rounds-off the correlation coefficients to two digits. 


h are to be displayed are enclosed in single quotation marks. 
line are indicated by a variable name being 


ber 13 checks to determine whether —99 has 
Statement 14 
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Example 4: Computer Assisted Instruction 


The power and versatility of APL is not limited to numerical a 
as the example below will illustrate. Although the use of the via ped ive 
direct teaching aid is not new, tremendous problems exist in retraining = a 
matter specialists to program for the computer at least some portion o in 
course work. APL appears sufficiently easy to learn, relative to such languag 4 
as the IBM Coursewriter II, that the initial success so important to d 
course author prepare material for computer instruction, is easily a s 
The example given below is intended to show how alphabetic information e 
be handled in the АРТ, System. The final result of the program is similar 
the result provided by the editing function of Coursewriter II. 


VFC-—CHECK ANS 

[1] ANSC-—'ANAPLASTIC CARCINOMA" 
D] ANSR—'CARCINOMA ANAPLASTIC’ 
[3] CTRC.—.- /ANSC- ANS 

[4] CTRR—.- /ANSR— ANS 

D] —(CTRC- CTRR)/S 


[6] FCO CARCINOMA ANAPLAST 


ICT1-- ((ipANSR) x ANSR = ANS] 
[| 20 
М FC — ANAPLASTIC CARCINOMA [1+ (pANSC) x ANSC — ANS] 
3] V 
If the student Tesponds with the words ANAPLASTIC CARCINOMA ОГ 
CARCINO 


MA ANAPLASTIC and one 
function CHECK, 


responded with, Say, ANEPLISTIC р 


STIC -ARCINOM-, Тһе power of 
and 7 in the above example. 

marks are stored by the system a 
that alphanumeric vectors may 


: -PL- 
ARCINOME the function would return AN ud 
APL is of course embodied in сарман" 
Characters enclosed between single quot 


. ed 
5 alphanumeric characters. Tt should be notice 
be subscripted, 


4. CONCLUSION 
There is no 


: question in the mind of the writer that APL, as devised by 
Iverson, is a language that should most certainly receive considerable attention 
of not only researchers with Specialized mathematical problems, but also by 
teachers at all levels of instruction. Тһе main advantages of APL rest i" 
precise and meaningful symbolism, and its applicability in the classroom as WC 

as with the computer, There would also appear to be merit in the use of 


. . er- 
for Purposes of mathematics Instruction and the construction of comput 
assisted teaching Programs, 


REFERENCES 
Iverson, K. Е. (1962). 4 Programmi 
Iverson, К. Е. (1966), Elen; "ng Lan 


"guage. New York : Wiley. 
nentary оз 14 ew Yor! y 


. г jates- 
"5. Chicago: Science Research Associ? 
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COMPUTER PROGRAM ABSTRACTS 


Text Sorting Progr: ive 
g Programs. By 
е, g: By J. R. Бінмік, Department of Education, University of 
Descriptio 1 7 
n (1742 Sorting Big A program designed to scan text and abstract details of: 


sentence length, 


2. iffere: r i h 
diffe nt words used in eac. sentence, 
8; numbe; i i Ve М 
rs of linkers operating at clause level and above (linker name output) in 


each sentence, 
k š š 
numbers of free linkers (linker name output) in each sentence, 


(optional) errors in sentence, 
a : 
umulative details of 1, 2, 5 at the end of each text in terms of errored and error- 


ons 


free text, 
7. ordered word-frequency list. 


A an program will handle texts containing up to 2,000 different orthographical words. 
ionary of linkers is read in as data tape 1. Errors must be numbered by type in the 
ill be assumed that all text is error-free. 


text at the data-preparation stage, otherwise it wi 
UA4 Sorting As above but facilities 1, 2, 7 and cumulative details of 1 and 2 only available 


for 1 number of texts. 
Тһе program will handle texts containing up to 500 different words in batches of four 


(more if they contain fewer words). 


ums Language Both programs are available in 
9, A version exists of UA2 suitable for running o 


mended as output (paper tape) is slow. 


ALGOL suitable for running on the 
n the Elliott 803B but is not recom- 


DFO9 is from a line- 


aper tape, output on the К. 
capitals are ignored, therefore lower- 


Data Format, Output Input is eight-hole p: 
Printer with a pre-set format. Words beginning with 
A title for each text is allowed 


а characters must be used at the beginning of sentences. 

and this title will be output at the head of the appropriate set of results. It must be enclosed 
by exponential signs. Each text must be terminated with a multiplication sign and the 
final character read must be the normal end message character. 


Computation Times UA2 deals with approximately 100 words per minute; UA4 about 
five times as many. 
аға preparation notes, аге avail- 


ta tape, together with d 
University of Durham. 


Availability Program tapes and da 

able from the Research Secretary, Department of Education, 
Reference 

Barent, М. R. (1959). Linkage. (Unpublished M.A. thesis, University of Leeds.) 
ons or Balanced 


ired Comparis 
Department of 


Slater's ; [and nearest adjoining orders] from Pa 
Incomplete Block Design Rankings. By J. Р. N. PHILLIPS, 
Psychology, University of Hull. 
a a set of paired comparisons ог, 
of preference rankings 
riptions]. It 
tput and no 


takes as dat 


d Tucker (1961), a set 
d a list of object desc! 


diagnosis of each is ош 


Deserta: 
scription Program A, the full program, 


following Schucker (1959) and Gulliksen ап 
rranged in a balanced incomplete block design) [an 


c š 
hecks the data for obvious errors: if any are found, a 
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Example 4: Computer Assisted Instruction 


The power and versatility of APL is not limited to nunierical calculations, 
as the example below will illustrate. Although the use of the computer as a 
direct teaching aid is not new, tremendous problems exist in retraining subject- 
matter specialists to program for the computer at least some portion of their 
course work. APL appears sufficiently easy to learn, relativ 
as the IBM Coursewriter П, that the initial success so im 
course author prepare material for computer instruction 
The example given below is intended to show how alphabetic information can 
be handled in the APL system. The final result of the program is similar to 
the result provided by the editing function of Coursewriter 1I. 


е to such languages 
portant to having a 
, is easily obtained. 


V ЕС--СНЕСК ANS 
[1] ANSC.—ANAPLASTIC CARCINOMA’ 
[2] ANSR-—CARCINOMA ANAPLASTIC’ 
[3] СТКС«- + /ANSC- ANS 
[4] CTRR—+/ANSR= ANS 
[5] —(CTRC- CTRR)/8 


FC——— CARCINOMA ANAPLASTIC 1+ (tpANSR) x ANSR = ANS] 
—0 


[8] FC——— ANAPLASTIC CARCINOMA'[1-- (pANSC) x ANSC = ANS] 
91 v 


If the student responds with the words ANAPLASTIC CARCINOMA or 
CARCINOMA ANAPLASTIC and one of these ans 


1 wers is conveyed to the checking 
function CHECK, the correct ans If, however, the student 
responded with, say, АМЕ 

STIC -ARCINOM-. The 


1 between single quotation 
s alphanumeric characters. It should be noticed 
ectors may be Subscripted, 


4. CONCLUSION 
There is no question in 


ә һе writer Н 
Iverson, is a language that should Most cert: ep APL, as dev 
of not only researchers with 


There would also 
for purposes of mathematic. 


|. S instruction 
assisted teaching programs, 


REFERENCES 


ing Language, New Y SESS 
Functions, Chi У Хонг: Wiley, 


Саво: Science Research Associates 


Iverson, К. Е. (1962). 


A Programm 
Iverson, К. Е. (1966). 


Elementary 


Tl 
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COMPUTER PROGRAM ABSTRACTS 


Text Sorting Programs. By J. R. BIRNI, Department of Education, University of 


Durham. 
Description UA2 Sorting Бір А program designed to scan text and abstract details of: 
1. sentence length, 
2. different words used in each sentence, 
3. numbers of linkers operating at clause level and above (linker name output) in 
each sentence, 
numbers of free linkers (linker name output) in each sentence, 


4. 
5. (optional) errors in sentence, 
6. cumulative details of 1, 2, 5 at the end of each text in terms of errored and error- 


free text, 
7. ordered word-frequency list. 


The program will handle texts containing up to 2,000 different orthographical words. 
A dictionary of linkers is read in ав data tape 1. Errors must be numbered by type in the 
text at the data-preparation stage, otherwise it will be assumed that all text is error-free. 


UA4 Sorting As above but facilities 1, 2, 7 and cumulative details of 1 and 2 only available 
for n number of texts. 

Тһе program will handle texts containing up to 500 different words in batches of four 
(more if they contain fewer words). 


Computers, Language Both programs are available in ALGOL suitable for running on the 
KDF9. A version exists of UA2 suitable for running on the Elliott 803B but is not recom- 
mended as output (paper tape) is slow. 


Data Format, Output Input is eight-hole paper tape, output on the KDF9 is from a line- 
printer with a pre-set format. Words beginning with capitals are ignored, therefore lower- 
case characters must be used at the beginning of sentences. A title for each text is allowed 
and this title will be output at the head of the appropriate set of results. It must be enclosed 
by exponential signs. Each text must be terminated with a multiplication sign and the 
final character read must be the normal end message character. 


Computation Times UA2 deals with approximately 100 words per minute; UA4 about 
five times as many. 


Availability Program tapes and data tape, together with data preparation notes, are avail- 
able from the Research Secretary, Department of Education, University of Durham. 


Reference 
Barent, М. R. (1959). Linkage. (Unpublished M.A. thesis, University of Leeds.) 


Slater’s і [and nearest adjoining orders] from Paired Comparisons or Balanced 
Incomplete Block Design Rankings. By J. P. N. Рниллрѕ, Department of 
Psychology, University of Hull. 


Description Program A, the full program, takes as data a set of paired comparisons or, 
following Schucker (1959) and Gulliksen and "Tucker (1961), a set of preference rankings 
(arranged in a balanced incomplete block design) [and a list of object descriptions]. It 
checks the data for obvious errors: if any are found, a diagnosis of each is output and no 


262 Computer Program Abstracts 


urther апа! е! еа. erwise umber 
hi t termines and outputs 2, the minimum n 

f lysis attempted. Oth it ае 

of inconsistencies in the data [and the object descriptions in all nearest adjoining orders 
obtained by rectifying the inconsistencies (Slater, 1961), using a procedure which is most 


A. Т 
fficient with relatively consistent data (Phillips, 1967). Тһе shorter Program B does no 
cum out the operations listed in square brackets. 


Language Elliot 803 ALGOL. 


Data Format Тһе data layout required is an almost exact Copy of a convenient form of 
experimental protocol. 


Space Required and Computation Time Bothare effectively unpredictable, since the program 


uses a recursive procedure, which is re-entered an unpredictable number of times, each 


time setting up arrays of unpredictable dimensions: however, both will be negati 


vely 
correlated with the consistency of the data 


(i) The print-out of an ALGOL comment, giving full instructions for use of the 
Program, with examples, 

(ii) The print-out of th 

(ii) The Elliot 803 five 

(iv) The Elliot 803 fast 


For users with access to an Elliot 803 with 8,192 Word-store, (i) and (iv) will probably be 
most convenient. For those wj i ton which accepts Elliot 803 ALGOL 
in 5-hole tape, (iii) will be applicable. For all other users, (ii) is most convenient, 
References 


GULLIKSEN, Н. and TUCKER, 


j R А general Procedure for obtaining paired 
comparisons from multipli Я Psychometrika 26, 173-183, 
PHILLIPs, J. P, N. (1967). A Procedure for de 


termining Slater's ; 1 joini 
orders. Br. J. math. statist, Psychol. 29, 217-225. “008 and all nearest adjoining 
Ѕснискев, R. E, (1959), 


А note on the use of triads 
metrika 24, 273-276. 
SLATER, P. (1961). Inconsistencies in a schedule of 
303-312. 
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BOOK REVIEWS 


Advanced Theory of Statistics. Vol. ПІ. By М. С. KENDALL and A. Stuart. London: 
Charles Griffin & Со. Pp. 552. 136s. 


This is the last volume of the revision of M. G. Kendall’s two-volume Advanced 
Theory of Statistics (1947-48). The original was initially planned in synopsis before the 
late war by a group of leading statisticians, but circumstances left it to Dr. Kendall to 
compile. ‘This was fortunate in that the outcome was a book with the unity of style given 
by single authorship yet with the encyclopaedic compass which has established it as the 
standard reference book of statistical theory. During the following two decades, there has 
been an immense outpouring of papers which have greatly widened and deepened our 
knowledge of statistical theory as it was in 1947 without substantially adding any new major 
statistical ideas. There have been, it is true, various all-embracing schemes: Decision 
theory, Information theory, Games theory, Neo-Bayesian theory, etc., which, while throwing 
new light on existing practice, have expired or are expiring without substantially altering 
that practice. Perhaps Sequential Analysis, with its peculiar aptness in clinical trials, is 
the only new idea contributed to statistical methodology which has found a permanent 
niche. But it was really a war-time development. This third volume completes the 
formidable task which Dr. Kendall took on, with Professor Stuart in partnership, of building 
the statistical part of this huge corpus of papers into the original scheme. 

The book itself is, in essence, four separate books in one binding. That is, it consists 
of four distinct sections, each substantially self-contained, although the first leans somewhat 
on the theory of least squares developed in Volume II. They are: Analysis of Variance 
(pp. 1-165), Survey Theory (pp. 166-238), Multivariate Theory (pp. 239-365) and Time 
Series Analysis (pp. 366-503). On stylistic grounds one may guess that the first two 
sections owe more to Professor Stuart and that the last two are largely Dr. Kendall’s. The 
former are distinguished by a fairly exhaustive coverage of the literature and tend as a 
consequence to have many of the properties, both good and less good, of a review. The 
great advantage of this form is particularly seen in the first section where, after giving a 
more rounded analytical general theory than was possible in 1948 and treating fully the 
components-of-variance model which now has a fairly well developed theory, space is 
found for the ragged edges of the Analysis of Variance where the present state of knowledge 
is inconclusive or altogether less cut and dried. This miscellany of topics includes notably: 
multiple comparisons and questions of design; questions of relaxation of the standard 
assumptions and techniques of transformation of variables intended to make these hold more 
nearly. 

The section on survey theory is an excellent compression of the often intricate though 
elementary sampling theory in this field. It also illustrates one drawback of the review 
approach, namely the very occasional lapse into uncritical reporting. For example, a page 
is devoted to a précis of Pathak’s nebulous * definition ’ of Sufficiency in finite population 
sampling. This is an attempt to generalize a valuable insight, and the application given 
here illustrates this insight but does not test the wider aspects of the general definition, so 
that the example tells us more than the (redundant) “ definition °. 

Both these two sections give a good guide to the present state of the theory, which is 
less true of the latter two whose virtue is their more organic unity. This is doubtless 
partly due to the fact that they are direct expansions of the corresponding sections in Dr. 
Kendall’s original book and partly perhaps to the more unitary nature of their subject 


matter, but the approach in these sections is noticeably more selective. 
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The section on multivariate analysis is possibly the best concise кусу 
basic distribution theory that exists. It is very strongly to be 2- со 
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criminant analysis (namely that the individual to be classified belongs to none of the 
populations in question) is not included in б 


he discussion of contingencies. Ve a 
The time series analysis here, as in the original Volume II, has the unique distinction 
of giving an adequate exposition of the mathematical theory yet remaining fundamentally 
statistical, which is to say relevant to practice and applicable, It is a pleasant change from 
the usual dreary exercise in formal symbol pushing in pursuit of unrealistic models. Some 
average methods beloved of old-school economists still 
d have been further tidied up and tied in with modern 


istic and parameter, in places would 
have made the argument clearer, And one would have supposed that Laplace’s Théorie 


Analytique would have been sufficiently familiar to the authors to dispel their belief that 
saddle-point methods were first Systematically introduced into statistics 144 years after that 


book was written. But all these are negligible criticisms when compared with the major 
virtue of this section, 
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I would have liked to have seen some mention of the use of scores other than integer ranks, 
and more comments on the somewhat embarrassing multiplicity of non-parametric methods 
that are available for any of the simple problems discussed. It is a pity also that no mention 
is made of permutation tests using the sample values themselves since, in certain circum- 
stances, these techniques can throw considerable light on the ‘ normal-theory ' methods. 

Professor Noether has a pleasant style of presentation and the mathematical level will 
certainly not deter any of his intended audience. The book certainly contains material and 
emphases not to be found in other texts on non-parametric methods. For instance, he 
includes a quite detailed examination of the standard methods when the underlying random 
variable is discrete; and of the closely connected problem of ties. Also, the degree of 
emphasis placed on distribution-free interval estimation is unusual and pleasing; and the 
same remark applies to his integrated development of the Rank Sum Test. 

In conclusion, I would express the hope that following this pleasant aperitif, Professor 
Noether will see fit to write a much extended textbook on his subject directed to the same 
audience. His efforts would almost certainly be appreciated by both teachers and students 
of statistics. J. С. FRYER. 


Principles of Psychological Measurement. Ву С. С. HELMSTADTER. London: 
Methuen. 1966. Pp. хх+ 248. 36s. 


This is an attempt to present, in a reasonably compact form, the present position in 
the theory related to this rapidly developing field. Some mathematical derivations of an 
elementary character are given, but, on the whole, the reader requires only a knowledge of 
descriptive statistical techniques and some familiarity with conventional test materials. It 
includes nine chapters of theory and a final overview. 

The concepts of standardization, objectivity, reliability and validity have been presented 
within a unifying framework of error types; for example, standardization under ‘ inter- 
pretative’ errors, objectivity under ‘ personal’ errors, reliability under * variable’ errors, 
and validity under ‘ constant’ errors. Standardization and reliability are dealt with in 
single chapters, but there are separate chapters on content, empirical and construct validity. 

Helmstadter makes the case that the purposes of testing can all be subsumed under 
© prognosis ’, and argues that, for example, “ diagnosis is an elaborate form of prediction 
involving the anticipated consequences of suggested therapies or other contemplated 
actions ". His concentration on validation techniques is consistent with this view, and in 
keeping with present educational trends increasingly concerned with guidance and evalua- 
tion of curricula. An early introduction of expectancy tables adds clarity to the statistical 
aspects of prediction. 

Unfortunately, his explanation of some of the more complex concepts lacks crispness. 
For example, the different kinds of errors involved in test-retest and parallel-form reliability 
determinations are neither well defined nor well illustrated. Тһе chapter on ‘ Construct 
Validity ' uses the highly philosophical language of Cronbach and Meehl's original paper 
on the subject before giving an example; by contrast, Cronbach's own elementary textbook 
adopts a heuristic approach via the construct of anxiety before attempting a definition, 
which is more suited to the average student. 

Chapters on Test Development and Scales and Inventories, incorporating a good deal 
of recent work, followed by a useful summary with diagrammatic illustrations of methods 
available for prediction of multiple criteria in the penultimate chapter on Multiple Measure- 
ment, may redeem the book, particularly in the eyes of students training for guidance work. 
It should also stimulate those readers capable of becoming specialists in the measurement 
field to continue their studies by reading the more detailed treatments of, for example, 
Gulliksen and Torgerson and the many other references collected in the five-page biblio- 
graphy. D. M. PENFOLD. 


266 Book Reviews 


i i d М.Н. SUNSHINE. Syracuse: 
iased Friendship Net. Ву T.J. Fararo En ` : 
gi ssec mir Press. 1964. Pp.x--124. (No price given.) 


РЕ j 
ts to describe a population of persons in terms of ‘random nets — 
reum of mathematical models of behaviour: that is, I = е : - E 
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is about. It effectively presents a sociometric procedure which is not well € € s ee 
* tracing ' in a set of persons, and proceeds to show its relationship to random = ee 
theory. By relating bias parameters to certain simple structure statistics o “с. 
between persons, a biased net tracing formula is derived and its consequences are disc : 


This is followed by an empirical study of friendship carried out at a Junior High School 
where the children are almost entirel 


у from lower income Negro families and where there is 
a high delinquency rate. 


Systematic discrepancies were observed between the basic formula eo 
parameters and the data. This appeared to be due to the assumption that the biase пей 
contact density was constant at each step and evidence is then found that this is indeed the 
case. The trouble is, of course, that any attempt to define a net formula based upon a 
variable contact density would be difficult and would, presumably, involve the addition of 
further parameters to the model, Notwithstanding, further studies are made on sub-nets 
of the population and interesting features found. However, it is doubtful if the basic 
inadequacy of the model can be satisfactorily removed. 

Hence the value of this study depends upon whether or not it illustrates the correct 
kind of mathematical model for studies of friendship or is, at least, a useful methodological 
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The second part of the book considers the process of design verification leading to 
testing the system as a whole, surveys the processes and problems in planning performance 
evaluations, and classifies data collection methods. ‘The analysis of human factors data in 
system evaluation and the application of human factors techniques to the manufacturing 
processes for system components are described. "Тһе book ends with a brief survey of the 
problems the human engineer can expect to encounter in fulfilling his role in system develop- 
ment. 

The idea behind this book is well conceived: a thorough appraisal of the topics covered 
was needed. Yet the book is disappointing. The preface, intended to describe what is 
covered and what is excluded, is earnestly and carefully written but somehow fails to make 
quite clear either the contents or the intentions. Opening the book at random usually 
reveals a series of general statements, bordering on homilies, which are often well worth 
making but are seldom pursued or enlightened by examples. Page after page of general 
statements makes uninspired, unmemorable, and finally, dull reading. Frequently thoughts 
already expressed are rephrased and repeated, although the style of writing makes this less 
obvious than it might otherwise be. When examples are included they refer to “a hypo- 
thetical system, the Hermes space tug ”. The intention to achieve continuity and coherence 
by examples relating to a single system is not fulfilled, and examples drawn from real 
systems might have made the issues clear and permitted better illustrations. ‘The photo- 
` graphs are largely irrelevant and very badly reproduced. 

Several chapters, particularly in the second part, contain much useful material, in the 
form of distinctions between concepts, tabulations of factors relevant to a particular issue, 
classifications of criteria, and discussions of basic terms. Such material is in the main 
thorough, painstaking and practical. Unfortunately these merits which the book possesses 
tend to be overlooked because of its inadequacies. V. Davip HOPKIN. 
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